A561L fasta
A561L fasta

>gi|9632119|ref|NP_048917.1| hypothetical protein [Paramecium bursaria Chlorella virus 1]

MADVEVPINVITEINNLNELNNVNTGNEASGGISRWLIYTIIVLVVAAVVGLVIFIVKKVNNKKRESGQI
IQEVSQVLTENVDKKVANDLLDVALRARTEQAQLEANSATSVQLAAVGKISQSDADMAIKSAADAKIKAE
QANIEAAKAIENIRKQELDNANKAVAMATTAAAAISATYQDKSKDIIDNKLREAEETYKQVVQQRQLAEK
EYNASLATRRQAERNVYNKKVTEANETIKSVKKTEDVLKVLDDKIKQAKKATDDFKKKKQADKNK
PPASK
PSPGPKPAPKPSPGPKPAPKPSPGPKPSPG
PSACPQFQTRDSKGQCVCDPRQGVTWDGKSCVCDMKNGWN
WDGKKCVKGACPQFQTRDSKGQCVCDPKQGVTWDGKGCVCDMKNGWNWDGKKCVKS
GGGGGGGNVSGTIT
LMSNEETAVLNKGTYPGQGGKRYNRQWNVPGNMSDSCLLEFDIFFPSNFWFGCQGKIGGFFLSRPGQRGV
ASGCAKPGKRTGASYRVMWGGTTYKNGKRVGRDGSGVYPYLYFDDSTNSKQIPKLKQVEDCGHSIMVEEF
SRSIKRNAWNNIKIGLKLNTIGQRNGLIYFEVNGQKQTQDQVMWTSSSDFNIKYVIFGTFYGGCTGQNIN
QIPNTFVKYKNVKISKWSP

649AA, pI=9.43

Position 269679 - 271628

The blue region is absolutely identical to a segment of A565R. A565R has no other regions of homology to A561L. This region also resembles teneurins which are transmembrane proteins. It also contains repeats. The C-terminus is homologous to alginate lyases.

The red regions are spacers.

The green region is annotated as mitofilin (a mitochondrial inner membrane protein), but blastp finds the mitofilin domain only with seqences containing all the colored regions.

N-terminal homologs are found in all chloroviruses.

Gene + 10 fasta

>gi|145309287:c271638-269669 Paramecium bursaria Chlorella virus 1, complete genome ATGGCCGATGTTGAGGTTCCTATAAACGTGATTACTGAAATTAATAATCTGAACGAGTTA AACAATGTCAATACCGGCAACGAAGCGTCCGGTGGCATAAGCCGCTGGTTAATATATACTATAATTGTAC TTGTCGTCGCAGCTGTTGTTGGGTTGGTAATTTTCATTGTTAAGAAGGTAAACAATAAAAAACGCGAATC TGGGCAAATTATACAAGAAGTCTCGCAAGTTTTAACAGAAAATGTTGATAAAAAGGTTGCAAACGACCTT CTCGACGTTGCTTTACGAGCTCGAACCGAACAGGCACAATTAGAAGCGAATTCAGCAACAAGTGTCCAAC TCGCCGCCGTTGGTAAAATATCTCAGAGCGACGCGGATATGGCAATCAAATCAGCAGCAGATGCTAAAAT AAAGGCAGAACAAGCGAATATAGAAGCAGCAAAAGCTATAGAAAATATAAGGAAACAAGAGCTTGACAAT GCAAATAAGGCAGTTGCCATGGCGACTACTGCTGCTGCTGCAATTAGTGCCACTTATCAAGATAAATCAA AAGATATAATTGATAACAAACTTAGAGAAGCGGAAGAAACTTACAAACAAGTCGTTCAACAACGCCAACT TGCAGAAAAAGAATATAACGCGAGTTTAGCCACGCGGAGACAAGCGGAACGTAATGTGTACAACAAAAAA GTTACAGAAGCCAATGAAACAATAAAATCAGTTAAAAAAACGGAAGATGTGCTAAAGGTGTTAGATGATA AAATAAAACAAGCAAAAAAAGCCACAGATGATTTCAAGAAAAAAAAACAGGCTGACAAAAATAAACCACC TGCTTCAAAGCCCTCACCGGGTCCTAAGCCCGCTCCTAAACCATCACCGGGTCCTAAACCCGCTCCTAAA CCATCACCGGGTCCTAAACCATCCCCTGGACCGAGCGCGTGCCCTCAGTTCCAAACTCGCGATTCTAAGG GCCAGTGTGTTTGTGACCCCAGACAAGGCGTCACATGGGATGGTAAGAGTTGTGTATGCGACATGAAGAA CGGGTGGAACTGGGACGGAAAGAAGTGTGTAAAAGGTGCATGTCCCCAATTCCAAACCCGCGATTCTAAG GGTCAATGTGTTTGCGATCCCAAACAAGGGGTGACATGGGACGGAAAGGGCTGTGTTTGCGACATGAAGA ACGGGTGGAACTGGGACGGAAAGAAGTGTGTGAAATCGGGTGGTGGAGGTGGCGGTGGAAATGTAAGTGG GACGATAACATTAATGAGCAATGAAGAAACTGCGGTATTAAACAAAGGGACATACCCAGGACAGGGTGGG AAACGTTACAATCGTCAGTGGAACGTCCCAGGCAATATGTCCGATAGTTGTTTATTAGAATTTGACATTT TCTTCCCCAGTAACTTTTGGTTCGGGTGCCAGGGGAAAATTGGAGGATTCTTCTTATCAAGACCTGGACA GAGAGGAGTCGCTTCGGGTTGTGCAAAACCTGGAAAGCGAACTGGTGCGAGTTATCGCGTGATGTGGGGA GGTACTACTTATAAAAACGGTAAACGCGTTGGTCGTGACGGAAGTGGAGTGTATCCATATTTATACTTTG ACGATTCTACGAATTCTAAACAGATCCCTAAATTGAAACAGGTAGAAGACTGTGGGCACAGCATCATGGT GGAAGAATTTTCGAGGAGTATCAAGAGAAATGCGTGGAATAACATCAAAATTGGTCTTAAATTAAATACG ATCGGACAAAGAAACGGGTTGATTTATTTCGAAGTGAATGGTCAAAAGCAAACACAAGACCAAGTTATGT GGACGTCTAGTTCGGATTTCAATATAAAATACGTCATATTTGGAACATTCTATGGCGGCTGCACAGGACA AAATATAAATCAAATTCCTAATACGTTTGTAAAGTACAAAAACGTAAAAATTTCTAAATGGTCGCCCTAA

C-terminal

GGAAATGTAAGTGG GACGATAACATTAATGAGCAATGAAGAAACTGCGGTATTAAACAAAGGGACATACCCAGGACAGGGTGGG AAACGTTACAATCGTCAGTGGAACGTCCCAGGCAATATGTCCGATAGTTGTTTATTAGAATTTGACATTT TCTTCCCCAGTAACTTTTGGTTCGGGTGCCAGGGGAAAATTGGAGGATTCTTCTTATCAAGACCTGGACA GAGAGGAGTCGCTTCGGGTTGTGCAAAACCTGGAAAGCGAACTGGTGCGAGTTATCGCGTGATGTGGGGA GGTACTACTTATAAAAACGGTAAACGCGTTGGTCGTGACGGAAGTGGAGTGTATCCATATTTATACTTTG ACGATTCTACGAATTCTAAACAGATCCCTAAATTGAAACAGGTAGAAGACTGTGGGCACAGCATCATGGT GGAAGAATTTTCGAGGAGTATCAAGAGAAATGCGTGGAATAACATCAAAATTGGTCTTAAATTAAATACG ATCGGACAAAGAAACGGGTTGATTTATTTCGAAGTGAATGGTCAAAAGCAAACACAAGACCAAGTTATGT GGACGTCTAGTTCGGATTTCAATATAAAATACGTCATATTTGGAACATTCTATGGCGGCTGCACAGGACA AAATATAAATCAAATTCCTAATACGTTTGTAAAGTACAAAAACGTAAAAATTTCTAAATGGTCGCCCTAA

Reverse complement

TTAGGGCGACCATTTAGAAATTTTTACGTTTTTGTACTTTACAAACGTATTAGGAATTTGATTTATATTT TGTCCTGTGCAGCCGCCATAGAATGTTCCAAATATGACGTATTTTATATTGAAATCCGAACTAGACGTCC ACATAACTTGGTCTTGTGTTTGCTTTTGACCATTCACTTCGAAATAAATCAACCCGTTTCTTTGTCCGAT CGTATTTAATTTAAGACCAATTTTGATGTTATTCCACGCATTTCTCTTGATACTCCTCGAAAATTCTTCC ACCATGATGCTGTGCCCACAGTCTTCTACCTGTTTCAATTTAGGGATCTGTTTAGAATTCGTAGAATCGT CAAAGTATAAATATGGATACACTCCACTTCCGTCACGACCAACGCGTTTACCGTTTTTATAAGTAGTACC TCCCCACATCACGCGATAACTCGCACCAGTTCGCTTTCCAGGTTTTGCACAACCCGAAGCGACTCCTCTC TGTCCAGGTCTTGATAAGAAGAATCCTCCAATTTTCCCCTGGCACCCGAACCAAAAGTTACTGGGGAAGA AAATGTCAAATTCTAATAAACAACTATCGGACATATTGCCTGGGACGTTCCACTGACGATTGTAACGTTT CCCACCCTGTCCTGGGTATGTCCCTTTGTTTAATACCGCAGTTTCTTCATTGCTCATTAATGTTATCGTC CCACTTACATTTCCACCGCCACCTCCACCACCCGATTTCACACACTTCTTTCCGTCCCAGTTCCACCCGT TCTTCATGTCGCAAACACAGCCCTTTCCGTCCCATGTCACCCCTTGTTTGGGATCGCAAACACATTGACC CTTAGAATCGCGGGTTTGGAATTGGGGACATGCACCTTTTACACACTTCTTTCCGTCCCAGTTCCACCCG TTCTTCATGTCGCATACACAACTCTTACCATCCCATGTGACGCCTTGTCTGGGGTCACAAACACACTGGC CCTTAGAATCGCGAGTTTGGAACTGAGGGCACGCGCTCGGTCCAGGGGATGGTTTAGGACCCGGTGATGG TTTAGGAGCGGGTTTAGGACCCGGTGATGGTTTAGGAGCGGGCTTAGGACCCGGTGAGGGCTTTGAAGCA GGTGGTTTATTTTTGTCAGCCTGTTTTTTTTTCTTGAAATCATCTGTGGCTTTTTTTGCTTGTTTTATTT TATCATCTAACACCTTTAGCACATCTTCCGTTTTTTTAACTGATTTTATTGTTTCATTGGCTTCTGTAAC TTTTTTGTTGTACACATTACGTTCCGCTTGTCTCCGCGTGGCTAAACTCGCGTTATATTCTTTTTCTGCA AGTTGGCGTTGTTGAACGACTTGTTTGTAAGTTTCTTCCGCTTCTCTAAGTTTGTTATCAATTATATCTT TTGATTTATCTTGATAAGTGGCACTAATTGCAGCAGCAGCAGTAGTCGCCATGGCAACTGCCTTATTTGC ATTGTCAAGCTCTTGTTTCCTTATATTTTCTATAGCTTTTGCTGCTTCTATATTCGCTTGTTCTGCCTTT ATTTTAGCATCTGCTGCTGATTTGATTGCCATATCCGCGTCGCTCTGAGATATTTTACCAACGGCGGCGA GTTGGACACTTGTTGCTGAATTCGCTTCTAATTGTGCCTGTTCGGTTCGAGCTCGTAAAGCAACGTCGAG AAGGTCGTTTGCAACCTTTTTATCAACATTTTCTGTTAAAACTTGCGAGACTTCTTGTATAATTTGCCCA GATTCGCGTTTTTTATTGTTTACCTTCTTAACAATGAAAATTACCAACCCAACAACAGCTGCGACGACAA GTACAATTATAGTATATATTAACCAGCGGCTTATGCCACCGGACGCTTCGTTGCCGGTATTGACATTGTT TAACTCGTTCAGATTATTAATTTCAGTAATCACGTTTATAGGAACCTCAACATCGGCCAT