Monarch geneset OGS2.0

DPOGS206108
TranscriptDPOGS206108-TA3771 bp
ProteinDPOGS206108-PA1256 aa
Genomic positionDPSCF300028 + 398641-406454
RNAseq coverage2229x (Rank: top 5%)
Annotation
HeliconiusHMEL0050380.091.29% 
BombyxBGIBMGA004315-TA1e-0925.10% 
DrosophilaDp1-PB0.052.87% 
EBI UniRef50UniRef50_Q5TRN40.053.19%AGAP005467-PA n=4 Tax=Endopterygota RepID=Q5TRN4_ANOGA
NCBI RefSeqXP_969652.10.061.43%PREDICTED: similar to high density lipoprotien binding protein / vigilin [Tribolium castaneum]
NCBI nr blastpgi|2700138320.061.58%hypothetical protein TcasGA2_TC012484 [Tribolium castaneum]
NCBI nr blastxgi|2700138320.061.46%hypothetical protein TcasGA2_TC012484 [Tribolium castaneum]
Group
Gene OntologyGO:00037231.3e-17RNA binding
KEGG pathway 
InterPro domain[575-643] IPR0040871.3e-17K Homology
[579-638] IPR0181112.5e-15K Homology, type 1, subgroup
Orthology groupMCL14662 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206108-TA
ATGATGCATCAGCAATCCATGATGGTTGGAGACATGATACCCGTACATTCTGAAATACCCGTGCATACAGAGGAGATGAATAATGTTGGCTACGAGTCCAATAGCAACTCGTATGCTTATGATGACCTGTTCCCGGCTTTGCCTCACTCGCAGGCTCCAGTCCAACGCAACCTTCAACCCATCAGCAACAAACTGAGGGTTGGATCCTCTCTTCATACTCAGGTTTTCCACGTGCCGTATGAAGAAAGAAAACTGGACAACGCCAATACTTTTGGCGAGGGCGAGTCGTTAAGGACATGTCAGTCTATTACTAAGGACACCGGCGCTCATATTGAAATATCAACTAGCAAGGATGGAAGCCTGACATTCCTTATTACCGGAAAACATAGTGCAGTCCTAGATGCAAGGCGTTTGATCCTTACCCACTTTCAACAACAGGCGAGCAAGCAAATTTCTATTCCCAAGGAGCATCATCGTTGGATCCTTGGAAAGGGGGGTCTAAAGTTAAAAGAGTTGGAGAAAATGACGGCTACTAAGATCAGTGTTCCTGGAATAGCAGATAATAGTGAAGTCATCACTATTACCGGAACCAAAGAGGGAATTGAAAAGGCTGAACATGAAATTCGTGTCTGTTCTGAGGAACAGTCCCGAAAAGCTCTGGAGCGTATAATAGTCCCGAAGATCTATCATCCCTTCATCCAAGGACCATTTGGTGAAACGGCGGAAGCTTTAAGTTCTGAGACCGGCGCTCGTATACATATTCCACCGGCTTCAACCAAAAGTAACGAGATCGTTATTGCTGGTGAAAAGAACGGTGTGCTGGCAGCCAAGGCTAGGATCGAGCAGATTTATGAGGAGATGGCAAAAAAATGTTCAACTGTACGTGTCGAAGTGCCTAAGTCGCAGCACAAATACGTTATTGGATCTCGCGGAACTACTATCCAAGAGATTTTGAAAGAAACGGGTGTTTCTGTGGAAATGCCACCTCCGGATTCACCCACGGGTACCATTACTTTACATGGTCCTCATAACAAGATTGGTCTTGCTCTATCAAAGGTGTGTGAGAAAGCAAACTCTGTGAAAACTGCAACCGTTGATGCACCTACCTGGATTCATAAGTACATAATTGGAAAGAATGGCTCTAATATTAAGAAGATTACTCAGGACTTCTCAAAGGTGCACGTAGACATTACACACTCTGAAGATAAAGTCAAAATTGATGGACCTCCAGAGGAAGTTGAACGCGTCCAAGTGGAATTGGATAACTTTGTGAAGAACTTGCTTGCTACACATACATATGTGGAGTTGACTGTTGACCCTAAATTCTTTAAGCATATCATTGGCAAAAACGGAAGTAACATTAATAGACTGAAGGTTGAGACTCGTGTAGTTATAAATATTATTGAGAGTGAAGGCAACAATGTTATACGCATCGAAGGCAGCCATCAAGGTGTCGACGACGCTGAAAGAGAACTGCGCGAAATGGTTATGAAGTTAGAGAACGAAAGGACGAAGGAAGTCTTTGTTGACCACAAATATATTAAATCATTAATAGGAGTTAGAGGTGACAGAATAAAGGAAATTCGTGAGAAATTCGACCGAGTACTCATATCACTACCGGATCAAGGTCAAAAGAGGAGTCCCATCAAACTCCGAGGACCGCAGGAGGATATAGAAAAATGTGAATCACACCTCCATAAACTGATGAAAGAAATTGCCGAATCGTCTTACATACAAGAAGTGCCTATCTTCAAACAATTCCATAAGTTCATTATCGGTAAGGGTGGTGCTAATTTAAGAAAGATAAGAGACGAAACACAAACGCAGATCGATCTGCCTGCTGAAGGGGACGACAGCGATGTTATTACAGTGAGAGGTAAACGTGAAAACGTAGAGGAGGCCGTTAAAAGAATACAACAAATCCACAACGAGAAGGCGAACATTGTCACAGAGGAGGTAACGATAGCGCCTAAATATTATAACTCACTGATTGGTGCTGGCGGTAAACTTATACATTCTATTATGGAAGAGTGTGGAGGTGTTCTAATAAAGTTCCCACCAGCCGAGAGTGATAGCGACAAGGTTGTGATAAGGGGACCGATCGAGGACGTGGAGAAAGCTAAGCAGCAGTTGTTAGCACATGCTTCGGAGCGCGAATTGACATCCCACACGGCCCACGTGCGAGCTAAACCAGAGCATCATAAGTTCCTCATTGGAAAGAATGGCGCTAACATCAAGAAGATCCGCGAGCAGACTGGCGCTCGTATCATTTTCCCTACTGAGAAGGATGAGGACAAAGAAGCCATTTTCATCATTGGTCGCGAGGCACAAGTGGAGGAGGCACGAAAGCAGTTGGAAGCCGCCGTTGCTGAAATCAGCAACGTGTCCGAGGGTGAGATGGCCGTGGACCCGCGCCACCATCGACACTTCGTGGCTCGACGTGGAGAAGTGCTGAGAAGGATCGCTGAAGACTGCGGGGGAGTCCAGATATCATTCCCACGACAGGGAGTCAACAGCGATCGCGTTGTTCTCAAGGGGCCTAAGGAATGCATTGAGGCTGCCAAGAATCGGATCACCGAGATCATTGAGGATCTGGAAGCGAAGGTTACCATTGAATGTATCATTCCACAAAGACATCACCGAACGGTGATGGGGGCGCGCGGTGCCAAGGTGAAGGACATTACAGCCGAATTTGATGTTCAAATCAAGTTCCCTGAGCGAGACCTCACTGAGGGTGCTGATATTCCACTAAGAAACGAAGATAACGCTGAACCAGGACAAAATGACATCATCAGAATAACTGGACGGCCGGAGAATTGTGAAGCGGCCAAAAAAGCTCTGCTGGACCAAGTTCCTATTACAATTGACGTTGAAGTGCCAAATGATCTTCACCGTTTGCTCGCCGGTCAAAAGAGGAGGGAATTGATGCAGACCTATGACGTTCACATTCTAATGCCACCACCTAATGAAGAAGCCTCTGATATTGTGAAGGTCACCGGTACACCTACAAATGTTGAGAAAGCAAAGGTGGCACTTGCTGAGAAGATTGTAGAGATGGAGAAAGAAAAAGAAGATAGGATTCTAAGATCGTTTGAGCTGAAATTCAAAGTGGACCCTGAATACCACCCTCTTGTTATTGGTAAAGGTGGCTCAGTGATTACTAAGATTCGCACAGATTACGGAGTACAAATAAATCTACCAAAGCGAGGTGAACCCGATGATGATATTATCACCATACAAGGATACGAAGATAAGGCACATCAAGCCAAAGAAGCCATTATGAATATAGTTCACCAACTTGATAACCAATATCGTGACGAGGTGGACATCGATCCCCGCGTCCATAGAAGACTGATAGGTCTACGAGGAAAGAACATAAGGCGCATTATGGACGAGTACAAAGTTGATATTCGTTTCCCGAAACAAGGAGACGACAGCATCGTTATAATAACCGGTGATGAAGACAACGTTCTCGACGCCAAAGACCACCTTCTCAATCTAGCCGAGGAATACTTGCAAGACGTAGTGGACCGCTACCAGAGGCCGGCCGGTCCATCTCTGGGCGATTTCGGGGACGTTCTCAACACTGAGAATACAAATAACGGCGGCGCTGCTGCCGTTCAGCCGTCCGGCGGGTTCGTGGTGAAGGGCGGGCCGTGGGAGCAGCGCGCCCCCGACACAGCCTCCACCCACGAGTTCCCAACAATGCCGGGAGCTCCACGAGCGGCGGCAAACCCCACACCCTCCTCCGCGTGGGGCCCTCGCCGCTAA

Protein sequence:

>DPOGS206108-PA
MMHQQSMMVGDMIPVHSEIPVHTEEMNNVGYESNSNSYAYDDLFPALPHSQAPVQRNLQPISNKLRVGSSLHTQVFHVPYEERKLDNANTFGEGESLRTCQSITKDTGAHIEISTSKDGSLTFLITGKHSAVLDARRLILTHFQQQASKQISIPKEHHRWILGKGGLKLKELEKMTATKISVPGIADNSEVITITGTKEGIEKAEHEIRVCSEEQSRKALERIIVPKIYHPFIQGPFGETAEALSSETGARIHIPPASTKSNEIVIAGEKNGVLAAKARIEQIYEEMAKKCSTVRVEVPKSQHKYVIGSRGTTIQEILKETGVSVEMPPPDSPTGTITLHGPHNKIGLALSKVCEKANSVKTATVDAPTWIHKYIIGKNGSNIKKITQDFSKVHVDITHSEDKVKIDGPPEEVERVQVELDNFVKNLLATHTYVELTVDPKFFKHIIGKNGSNINRLKVETRVVINIIESEGNNVIRIEGSHQGVDDAERELREMVMKLENERTKEVFVDHKYIKSLIGVRGDRIKEIREKFDRVLISLPDQGQKRSPIKLRGPQEDIEKCESHLHKLMKEIAESSYIQEVPIFKQFHKFIIGKGGANLRKIRDETQTQIDLPAEGDDSDVITVRGKRENVEEAVKRIQQIHNEKANIVTEEVTIAPKYYNSLIGAGGKLIHSIMEECGGVLIKFPPAESDSDKVVIRGPIEDVEKAKQQLLAHASERELTSHTAHVRAKPEHHKFLIGKNGANIKKIREQTGARIIFPTEKDEDKEAIFIIGREAQVEEARKQLEAAVAEISNVSEGEMAVDPRHHRHFVARRGEVLRRIAEDCGGVQISFPRQGVNSDRVVLKGPKECIEAAKNRITEIIEDLEAKVTIECIIPQRHHRTVMGARGAKVKDITAEFDVQIKFPERDLTEGADIPLRNEDNAEPGQNDIIRITGRPENCEAAKKALLDQVPITIDVEVPNDLHRLLAGQKRRELMQTYDVHILMPPPNEEASDIVKVTGTPTNVEKAKVALAEKIVEMEKEKEDRILRSFELKFKVDPEYHPLVIGKGGSVITKIRTDYGVQINLPKRGEPDDDIITIQGYEDKAHQAKEAIMNIVHQLDNQYRDEVDIDPRVHRRLIGLRGKNIRRIMDEYKVDIRFPKQGDDSIVIITGDEDNVLDAKDHLLNLAEEYLQDVVDRYQRPAGPSLGDFGDVLNTENTNNGGAAAVQPSGGFVVKGGPWEQRAPDTASTHEFPTMPGAPRAAANPTPSSAWGPRR-