Monarch geneset OGS2.0

DPOGS206726
TranscriptDPOGS206726-TA2400 bp
ProteinDPOGS206726-PA799 aa
Genomic positionDPSCF300320 + 57990-66603
RNAseq coverage810x (Rank: top 16%)
Annotation
HeliconiusHMEL0120602e-15580.00% 
BombyxBGIBMGA002821-TA0.065.21% 
DrosophilaCG40178-PB0.040.51% 
EBI UniRef50UniRef50_F7ITY40.045.65%AGAP001070-PA n=20 Tax=Endopterygota RepID=F7ITY4_ANOGA
NCBI RefSeqXP_001655444.10.050.13%hypothetical protein AaeL_AAEL002502 [Aedes aegypti]
NCBI nr blastpgi|3784664210.069.70%DnaJ-25 [Bombyx mori]
NCBI nr blastxgi|3784664210.070.60%DnaJ-25 [Bombyx mori]
Group
Gene OntologyGO:00310721.4e-29heat shock protein binding
GO:00064571.3e-18protein folding
GO:00510821.3e-18unfolded protein binding
GO:00454548.9e-09cell redox homeostasis
KEGG pathway 
InterPro domain[16-104] IPR0016231.4e-29Heat shock protein DnaJ, N-terminal
[34-52] IPR0030951.3e-18Heat shock protein DnaJ
[131-250] IPR0123361.5e-16Thioredoxin-like fold
[143-234] IPR0137668.9e-09Thioredoxin domain
Orthology groupMCL14850 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206726-TA
ATGAGATGGAAATGCAAAAGCTCGAAGGGGCGATGGCTGTATGTGCTGTTGATGTTGATAGTGCTCCCCGTAGTTGTGGCTCAGAAGATCGGTGACCCTTACAAAATCCTCGGCATCAATCAACGAGCTACTCTACCCGAAATCCGCAAGGCTTATAGACAATTAGCTAAAGAATGGCATCCAGATAAGAATGAGAACCCGAACGCCGAAGCCAGGTTTGTTGAGATCAAGCAAGCATATGAACTGCTATCGGACACTGAACGGAGACAGGCATACGACTTGTACGGGATTACCAACGAAGATGACCACATGTACAAGCAACGGCACGATTACAGTCAATATGCAAGGTTTAGCAATGACCCGTTTGAGTTCTTCAGTACTCATTTCCGAGCGCAGGATCAGGACATAACATTGTTCCACAAACTCTCTGTAACAACGAGGCATTTCGAAAATAATATACTGGAGAAATCAGTACACACTCCAGCTTTAGTTCTGTTTTACACTGATTGGTGTTTCGACTGTGTGAGATCGGCTGCGTCCTGGAGGAAGTTGGTGGACTCCTTGCAGCCTTTGGGCGTGACCCTGGCTACTATACACGCTGGACACGAAGCAAGTCTGGCTAGAAGGATAGGAGTACATAGCGTACCCTGCCTCACGTTGATCTTAGACAAGCAGATTTACATTTACAAGGACGGGCTAAACTCATTACCAAAGATATTAGAATTCATGCGCTGGAAGTTCCCCTACAAGCTGGTCCGTGGTATAAACGATGGGAATGTGGATTCCTTTGTGACCGACTTCGAAGACAACAAAGTCAAAGCCTTGATTTTCGAGGAGCGTCAAACCATCAGGCTGAGATACTTGATCACAGCGTTCCATTACAGGGACAGACTGTCGTTTGCCTTCGTTGACATATCAGCTCGTGACACCGCGAACGTCACATCGAGGTATAAGGTCCAAAGGTCCATGGACACCATGGTCCTGCTGAAGGAAGACAGCATAGAACCCGCCGCCACCGTCAGCACCACAGAAATACAAACACAGACTATGAGGCAGTTGATAGAGGCTAATCAGATGCTGACGCTACCCAGGCTATCCTCACAGAACATTCTAGACACAGTGTGTCCCGTGGAGTGGCGTGCCGCCCGTCGCGTGCTCTGCTGCGTGCTTGTCGTGCGGGATGAGAGGGATGTCCGGTCTAACGCCCACAGTATTCAACAGTTACGTGACCTGGCCCGACGCGCTCCCGACCGTATACGGTACACATACGTGTACGAACACGCGCAGCCTGACTTCGTTAACGCGCTCGCCAACGGGTCCGGTATAGATCTTTCAAGCCTGGATCATCGTATCGTGGTGATATGGCGTCGGGAGTCCACCAGGATACAGTACGAGTGGCTGAAGGAGAGCTGGCCGAGCTGCGGTCGCTGCCAGGGCGAGGAGGGGGTCAGCTACCAGGATAAGATGAACCGCACCCAGCGGGCGCTCGACGAGATGCTCAAGAGACTGCTGAGGCCCAGTGAGGTGGTCGCCTACGAGGCTAGGATTCAGGAGCTGGTGGATGAGTCGTCTCCTTGTGGCGCTCGTCTGGTGATGGCTCGCGTCTCCGAATGGATCGAGCGCGCGATGTCCGCTCTCAGGTCACACCACGCTCTGTCGGCGCTCTCAATACTGGCTACTGTGGCGTTGGTGCTCGCTGCCGGGTACTTCATGGCATACCTCATACGAGTAGAAGAAGAATCGGTACAACGCGAGAAGGAAGAGAGAAGGAGACAAAATGGCGGCAAACGGAACAACAACGAGGCGCAGCCGGAAATGAGATTACACGAGCTACGAGCGGAGAAATATAATGGCTTGGTCAGGTTACAGAAGCCGGGTTGCAGGACCATAGTGCTGCTGGTAGACTCTCAGAGTCGGGTTCAACTGCTGTCTAAGTTCCACAGGATCGTATGGCCGTATCGCAAGAACAAAACCCTGGTGTTCGCGTACCTCTGTGTGGAGAGAAACGTGGAGTGGTTCCGGCGAGTGCTCCAGCTGTCCCTGGGCGGCGGCGGGGAACTGCGCGTCAACAGGAGGAACTGTGTGGGCACTGTGCTGGCTCTGAACCCGCACAGGAAGTACTTCTGCATCTACCACGCCAAACATCCCGAGTGCGTCAAACCGCACAAGCGTATGAGTCGGATGGCGGCTTCCCTGGGCGGGCGGGCGCCGGACCCTGAAGCCGGCGCCTTCATCGGCTTCACCACCGATCCTGATTCCTCCGACGACGACTGTTACGACCCACCCTTACTACTGCAAGAAAATCTTCTCGATGGACTGGAAAATTGGCTGGACAGGCTGTTCGAAGGCAGCACCCACCGCTATTACGTCAACTATTGGCCGGATATGACGACTAAGTGA

Protein sequence:

>DPOGS206726-PA
MRWKCKSSKGRWLYVLLMLIVLPVVVAQKIGDPYKILGINQRATLPEIRKAYRQLAKEWHPDKNENPNAEARFVEIKQAYELLSDTERRQAYDLYGITNEDDHMYKQRHDYSQYARFSNDPFEFFSTHFRAQDQDITLFHKLSVTTRHFENNILEKSVHTPALVLFYTDWCFDCVRSAASWRKLVDSLQPLGVTLATIHAGHEASLARRIGVHSVPCLTLILDKQIYIYKDGLNSLPKILEFMRWKFPYKLVRGINDGNVDSFVTDFEDNKVKALIFEERQTIRLRYLITAFHYRDRLSFAFVDISARDTANVTSRYKVQRSMDTMVLLKEDSIEPAATVSTTEIQTQTMRQLIEANQMLTLPRLSSQNILDTVCPVEWRAARRVLCCVLVVRDERDVRSNAHSIQQLRDLARRAPDRIRYTYVYEHAQPDFVNALANGSGIDLSSLDHRIVVIWRRESTRIQYEWLKESWPSCGRCQGEEGVSYQDKMNRTQRALDEMLKRLLRPSEVVAYEARIQELVDESSPCGARLVMARVSEWIERAMSALRSHHALSALSILATVALVLAAGYFMAYLIRVEEESVQREKEERRRQNGGKRNNNEAQPEMRLHELRAEKYNGLVRLQKPGCRTIVLLVDSQSRVQLLSKFHRIVWPYRKNKTLVFAYLCVERNVEWFRRVLQLSLGGGGELRVNRRNCVGTVLALNPHRKYFCIYHAKHPECVKPHKRMSRMAASLGGRAPDPEAGAFIGFTTDPDSSDDDCYDPPLLLQENLLDGLENWLDRLFEGSTHRYYVNYWPDMTTK-