Monarch geneset OGS2.0

DPOGS215965
TranscriptDPOGS215965-TA2733 bp
ProteinDPOGS215965-PA910 aa
Genomic positionDPSCF300078 - 730857-735879
RNAseq coverage2311x (Rank: top 5%)
Annotation
HeliconiusHMEL0058870.080.29% 
BombyxBGIBMGA000528-TA1e-3632.27% 
DrosophilaNAT1-PC7e-9142.24% 
EBI UniRef50UniRef50_D2A1L43e-17642.09%Putative uncharacterized protein GLEAN_08408 n=2 Tax=Tribolium castaneum RepID=D2A1L4_TRICA
NCBI RefSeqXP_969772.20.043.18%PREDICTED: similar to eukaryotic translation initiation factor 4 gamma, 2 [Tribolium castaneum]
NCBI nr blastpgi|1892367942e-17943.18%PREDICTED: similar to eukaryotic translation initiation factor 4 gamma, 2 [Tribolium castaneum]
NCBI nr blastxgi|1892367942e-17444.30%PREDICTED: similar to eukaryotic translation initiation factor 4 gamma, 2 [Tribolium castaneum]
Group
Gene OntologyGO:00160701.7e-79RNA metabolic process
GO:00054888.4e-63binding
GO:00055153.8e-52protein binding
KEGG pathwaytca:6582773e-180 
 K03260 (eIF-4F, EIF4G)maps-> Viral myocarditis
InterPro domain[111-337] IPR0160211.7e-79MIF4-like, type 1/2/3
[108-339] IPR0160248.4e-63Armadillo-type fold
[120-335] IPR0038903.8e-52MIF4G-like, type 3
[817-904] IPR0033071.8e-18eIF4-gamma/eIF5/eIF2-epsilon
Orthology groupMCL11904 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215965-TA
ATGGCATTCCATAGTACGGAGCACGACCTTGCCGAGACATCGCTAAGTCCCCGGGCGGTGCGGCGACCCCTTGCCAATATCGAGCGACGCCTCATCTTCAACTCTTTAGAACACTTAAAAGTCAAAATTTCAGGCTCATTTATTTTGAGCGTAAATGCATCGAGTTGTGAGAGGCGAGTTGTGTGCGCGATCGCGGGCGTGTTCGCGTGTGCGTGCGGACGGTCGTGCGGCGGTGGTGTGTGTGTGAGGCGGCGACACCGGCCGCAAGAGGTCCCCCCGAGGCGGCGCTGGGTCCCGCCATCCACGCAGCGGCACCACGATGTGCCCGACGGCGAAGCCAAGCACGACATCATCCATAGGAAAGTGCGTGGCATACTTAACAAACTCACGCCGGAGAAATTCCAAAAGCTGAGCGATGATCTACTCGCATTGGAATTGGATTCGGACAAGGTGCTGAAGGGGGTGATCCTTCTAATATTCGACAAGGCACTTGATGAACCTAAGTACTCGTCGATGTACGCTCAGCTGTGCAAGCGGCTCAGCGAAGAGGCGCCCAACTTTGAGCCGCCCGGCTCCCCCTGTACTTTCAAGCTGCTCCTGCTCAACAAATGTCGCACCGAATTCGAAAACCGCGCCCAGGCATTCGCAGCCTTCGAAGACAAGGCTCTCACTCCTGAGGAAGAGGAGAAACGGCATTTAGCCAAGTGTAAAATGCTCGGCAACATCAAATTCATCGGCGAACTGGGCAAGTTGGAGATCCTCGCGGAGTCGATCCTTCACCGTTGCATCCAGAACCTGCTGGCTCGCCGCGCTGCCGCCGAGCACCATGAAGACCTGGAGTGTCTGGCGCAGCTGGTGCGGACATGCGGCCGCGTGCTCGACTCAGAGCGCGGCCGCGGTCTCATGGATCAATATTTCGCACGAATCGAAACGTTGTCCAACTCGCGCGACCTCGCCCCTCGCATCCGGTTCATGTTGCGGGACGTGGTGGAGCTGCGACGCTCAGGTTGGCTGCCACGCACGGCGGTGTCAGCCGAGGGCCCGGTGCCCATACACCAGTTGCGCGCCGACGACGAGCCACCGCCTCGCCGCGAGCGCGAGCGGGAGCGCGACTCTCTGTTCAGGGGCGGGATGCGCTCGCGGCCCCTGGACGACGTGCTGGCCGGTCTCAGCCTGCAGCCAGCCGCCGCGCTCGTGCCGCCGCCGGACAAATTGTTCGGGAACGGTTTCGCTCCGCCGGCTTTCCGCCAGCGCTCGGCGCCCGGCTACTACCCGCGCTCGCATTACAAGCACCAACAGCATCAGCACGCGCCCTCCGCGGGCAAGGAGGGCTCGCGCTCCGGCAAGGCTCGTGTGGCGGTGGCAGCGGGCGCTCTGCAAGACGTCCAGATGCGCCCCGCCGCCAACTCTCTCATGTTCACGGCCAACAGGCTGTCTCGCCCTCCGCCCTCGCAGCTGCCACTCGCCTCCCAAAACGTGCTGACGCCAACATTCGCCAGTGCTCCGCCTCTTATGAAAGAGCCCTCCATCACTATCAAGCCCGCGCCCGACAAAAAGGATAAACCGAAAAAGGACAAGGGGTTGAACAAGGAGGAGGCGTGTCGCCTGGGCATCGAAGCGGTGAGTGCGCTGGCTGACAGCGACGAACGCATCGACCAACCGGAGCCGGAGCCCGAGCAAGAGCACGAGCCGACGCTGGCGGCGCCTCTGCGGCGTCTGCACGACCTCCAGCTGCCCGACAAGTTGCTAAGGCGCGTCATCGCCGCGGTGCTGGAGCACGCGGTCACGGCAGACATACGACCGCCGGCTGACAACGACACCGAACCAGACGGTGACGCCAATGATGAAGACGAGCGTGTGACTCGTCTGCTCTGGGCCGCCTGTAGTGTGGTGCGCGCTGTTAAACGAGCGCCCCTGGGCGAGCCGCTCAGGGCGCTCCTGCAGGCGCACCACCCCCACCGCCGTCTACCCGGCCTTCTGGCACACGCCATCAGACAGAAGTTAATATCGCTGTCAGAAGTAGGCACGTGGTGCGAGGGTGGTCAGTATCACCCGTTATTACTGGAAGTGCTGCAGTCCCTACGAGAACTCGTCGGCCTGGAGCGCCTACAAGACATGCTCGAGGATAGTAAGGTGAACCTATGCGCCTATGTGTCGGAGCGGGAGGGTGGTGCGGGCGGTCTTGATGCTCTGGAGGCTCGCGGTCTCGGCGCGTTGGTGCCGCAGCTGCGGGTGCAGGCGGCGTTGGCGCGACAGCTGGCCACGGAGCCGGCTCCGACCGCGCTCTACCGCTGGATCAAGGCCAACGTGGAACCCTCCGTGAGACACAACGCCGCATTCGTGTCAGCGTTGGTGGCGCTGGTCGCGGAGCACGTGACGGCGGCGGCGGGCGGCGCCTCGCCGGACAAGGCTGCACTCGAACGTGAGAAAGCGCTGGCCGAGGCCTACGCTCCACTCCTGACAGCCCTGCTCGAGGGCCGCTCTGATCTACAGCTTGCGGCCGTATACGCCGTGCAGGTGCATGCACACCATCATCGGTACCCTAAAGGTATGTTGCTACGCTGGTTCATGTACCTGTATAACTTGGAGGTATGCGAGGAAGACGCTTTCCTGCGCTGGCGCGAGGACGTCACCGATGCCTACCCTGGTAAGGGAGAGGCTCTGTTCCAGGTGAATACGTGGCTGACATGGCTTCAGCAACAGGAGTCCGAGGACGAGGAGGCAGAGGACTGA

Protein sequence:

>DPOGS215965-PA
MAFHSTEHDLAETSLSPRAVRRPLANIERRLIFNSLEHLKVKISGSFILSVNASSCERRVVCAIAGVFACACGRSCGGGVCVRRRHRPQEVPPRRRWVPPSTQRHHDVPDGEAKHDIIHRKVRGILNKLTPEKFQKLSDDLLALELDSDKVLKGVILLIFDKALDEPKYSSMYAQLCKRLSEEAPNFEPPGSPCTFKLLLLNKCRTEFENRAQAFAAFEDKALTPEEEEKRHLAKCKMLGNIKFIGELGKLEILAESILHRCIQNLLARRAAAEHHEDLECLAQLVRTCGRVLDSERGRGLMDQYFARIETLSNSRDLAPRIRFMLRDVVELRRSGWLPRTAVSAEGPVPIHQLRADDEPPPRRERERERDSLFRGGMRSRPLDDVLAGLSLQPAAALVPPPDKLFGNGFAPPAFRQRSAPGYYPRSHYKHQQHQHAPSAGKEGSRSGKARVAVAAGALQDVQMRPAANSLMFTANRLSRPPPSQLPLASQNVLTPTFASAPPLMKEPSITIKPAPDKKDKPKKDKGLNKEEACRLGIEAVSALADSDERIDQPEPEPEQEHEPTLAAPLRRLHDLQLPDKLLRRVIAAVLEHAVTADIRPPADNDTEPDGDANDEDERVTRLLWAACSVVRAVKRAPLGEPLRALLQAHHPHRRLPGLLAHAIRQKLISLSEVGTWCEGGQYHPLLLEVLQSLRELVGLERLQDMLEDSKVNLCAYVSEREGGAGGLDALEARGLGALVPQLRVQAALARQLATEPAPTALYRWIKANVEPSVRHNAAFVSALVALVAEHVTAAAGGASPDKAALEREKALAEAYAPLLTALLEGRSDLQLAAVYAVQVHAHHHRYPKGMLLRWFMYLYNLEVCEEDAFLRWREDVTDAYPGKGEALFQVNTWLTWLQQQESEDEEAED-