Monarch geneset OGS2.0

DPOGS204156
TranscriptDPOGS204156-TA2103 bp
ProteinDPOGS204156-PA700 aa
Genomic positionDPSCF300034 - 589903-595816
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0041224e-17866.38% 
BombyxBGIBMGA005038-TA1e-18062.84% 
DrosophilaCG7598-PA7e-7860.28% 
EBI UniRef50UniRef50_Q7Q4W43e-7761.54%AGAP000842-PA n=4 Tax=Endopterygota RepID=Q7Q4W4_ANOGA
NCBI RefSeqXP_975544.14e-7954.47%PREDICTED: similar to CG7598 CG7598-PA [Tribolium castaneum]
NCBI nr blastpgi|910872757e-7854.47%PREDICTED: similar to CG7598 CG7598-PA [Tribolium castaneum]
NCBI nr blastxgi|910872753e-7654.94%PREDICTED: similar to CG7598 CG7598-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[500-675] IPR0138572.6e-36NADH:ubiquinone oxidoreductase intermediate-associated protein 30
[494-676] IPR0089795.3e-33Galactose-binding domain-like
[302-490] IPR0077073.3e-20Transforming acidic coiled-coil
Orthology groupMCL24795 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204156-TA
ATGCATATAAGAAATGGTATGAAGGAACTTAAAACTGAATTTAAATTATTATCTCAAGAAATTAAGGAACATGTGTCCCATCCTGTACTTTTTGCAAGACCTGATGAAAGTTTTATTACTCCTGTGTCTTCGATGGGAGATCTCATGGCGATGTCACCAAGGAAGCCGAGGGAATTAGACCAACCTGATGTCACCAGTGACCTACTTACCGGTTCTGATTTTGATAGTATATGTGACGTTACTATGCAAGAGGTGCCTTTGTATGACGATAATTTGTTTATTGACGCAAACAGTTTAGATTATTTAGTAAAATGTGCAGAATCAAACCGAAACCACACTATGATAGATAGAGGTAAAGAAAGTTTATTTGTGAAATTTGATCCTTTATATGCAAAGCAGTTATTGAATAATTCAGAACCTTCACATGAAGTGTCACATTTGGAGTCCATAGATGGTGATATTGGTTATGAAACAGGAAGCAGCACATCAGCAATGGCTGATAATAATGTGGCTTCACCAAAACATACCCTGTCAGCAGGATCTATTGTATCAAAATGTACTAAAGATAAACCAACTCAAGTAGTTCCACCTGTAGTAAGTAATGAACTGTCTAAGGCCAGTGTGGCCCAAACAAGAACTACACCAGTTCTTCTTCGCAGTGTTTCAGCTATATTAGCTCCAACACAAGCTACAGAGAGACTTATAAACTTTTCCGATAACACCCCTATTGCTGCACCAAGAAGTCCACGTCAGTGTCGTCAATCAACTTATGCATCTCAGGAAGTGGATCGATTGCATTCGCTTAGAATCATATTACAAAACCAAGATCAGGAAGTATTACAATTAAGACAGGAGAACAGAGAACTCAAATCTTCACTTCAAGACCTTCAACATAACTGCACACGCACCAATGAGGAGTTAGAAAAAAAGGTTAAGAAACTCACCGATGAAAGGGACAGTCTATTGGAAAGAGAAATAAAATTAATTAAACAGGTTAATGACAAAACATTGAGTCATAAACAAATGTGCATAGTTATGGAGGAGTATGAAAAAACAATATCATCATTAATTGGTGAACAGCAAAAAGAAAAACTGTTTTCCCAAGAGTTGTATGACAAATTAACTTTGGAAAGAGACGAAGCTCTCAAACATCTGTCCAGTATGGAAAGCTCCTTTAATGATTTACTAGCAAAATATGAGAAATGTAAGAGTGTTATTATGGAGTCTAGAGACAAAGAAAAGATTCTTAATAACAAAATTAATGAATACGAATCAGGAATGCAAAAGTATGAAGCCTTATATAACAATCTTAAACAAGTAACCTCTGATAATCTTGAAAAGGCTAATGAAGAACTGGAAAATGTTAAAAAATCTCATAGTGTTGATATAACTAAGTTGCAAGAGTCTTTAGCTCAAAAGACAAGGGATAATGAGGAATTGACGCGTATCTGCGATCAGCTTATCAATGAAGTGCGCGAGACAGATTTATTATGGTGCTTTAACGAGACAAATGAATTAAATAAGTTTGTAACAACATGTGATAGTGACCATAATGAGGGTTACAGTTCCTGTGCCTTTGACATGAGCCCAGCTGGACGAGGATTGTTCCACGGATACCTCGACACCAGAACACCTAAAGATGGACGTATTAAAAAATCAGGTTATTGTGCCATTCGTTCAAAACGAGTCAGGAAAGCCTTCAAACGTGAAGCTACATATGACTGGAATCTGTATAACACGCTAGTGTTAAAAATAAGGGGCGACGGTCGGTCTTATTTGTTAAATATATCGTGCGAGGGCTATTACGATATAACGTGGAATGATATATACCATTACGTTTTATACACTAGAGGAGGACCCTACTGGCAAATAGCAAAGATTCCTTTTTCAAAGTTTATTTTAGGATCAAAAGGAAGATTACAGGATAAGCAAACAAGAATGAGGCTAGATAGAGTGACACATTTTGGAATATCATGTGGTGACAAAATAAATGGAGTTTTTAATCTCGAAATCGAATACGTTGGCTTAGAATTTGATCCCACGCACGATGAACAGTTTGCATACGAAATGTATAAAACCGATAGGTATATAGTCGGAGTTTAA

Protein sequence:

>DPOGS204156-PA
MHIRNGMKELKTEFKLLSQEIKEHVSHPVLFARPDESFITPVSSMGDLMAMSPRKPRELDQPDVTSDLLTGSDFDSICDVTMQEVPLYDDNLFIDANSLDYLVKCAESNRNHTMIDRGKESLFVKFDPLYAKQLLNNSEPSHEVSHLESIDGDIGYETGSSTSAMADNNVASPKHTLSAGSIVSKCTKDKPTQVVPPVVSNELSKASVAQTRTTPVLLRSVSAILAPTQATERLINFSDNTPIAAPRSPRQCRQSTYASQEVDRLHSLRIILQNQDQEVLQLRQENRELKSSLQDLQHNCTRTNEELEKKVKKLTDERDSLLEREIKLIKQVNDKTLSHKQMCIVMEEYEKTISSLIGEQQKEKLFSQELYDKLTLERDEALKHLSSMESSFNDLLAKYEKCKSVIMESRDKEKILNNKINEYESGMQKYEALYNNLKQVTSDNLEKANEELENVKKSHSVDITKLQESLAQKTRDNEELTRICDQLINEVRETDLLWCFNETNELNKFVTTCDSDHNEGYSSCAFDMSPAGRGLFHGYLDTRTPKDGRIKKSGYCAIRSKRVRKAFKREATYDWNLYNTLVLKIRGDGRSYLLNISCEGYYDITWNDIYHYVLYTRGGPYWQIAKIPFSKFILGSKGRLQDKQTRMRLDRVTHFGISCGDKINGVFNLEIEYVGLEFDPTHDEQFAYEMYKTDRYIVGV-