Monarch geneset OGS2.0

DPOGS209799
TranscriptDPOGS209799-TA5628 bp
ProteinDPOGS209799-PA1875 aa
Genomic positionDPSCF300117 - 595053-614932
RNAseq coverage1329x (Rank: top 10%)
Annotation
HeliconiusHMEL0089850.081.65% 
BombyxBGIBMGA008023-TA0.081.27% 
Drosophilal(2)01289-PF0.075.86% 
EBI UniRef50UniRef50_E0VU050.068.45%Predicted protein n=3 Tax=Arthropoda RepID=E0VU05_PEDHC
NCBI RefSeqXP_002429599.10.068.45%predicted protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420182650.068.45%predicted protein [Pediculus humanus corporis]
NCBI nr blastxgi|2700017980.072.54%hypothetical protein TcasGA2_TC000684 [Tribolium castaneum]
Group
Gene OntologyGO:00454543.2e-06cell redox homeostasis
KEGG pathwaycqu:CpipJ_CPIJ0052193e-08 
 K09580 (PDIA1, P4HB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[25-129] IPR0123365e-31Thioredoxin-like fold
[32-116] IPR0137663.2e-06Thioredoxin domain
Orthology groupMCL12504 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209799-TA
ATGGCTCGCTGGGTGCCCCTCGCGGCCCTGCTACTTCTCGTCACAGTCGACGCCGCGCGCAAACCGGCCCCGCCGCGCAGTGAGCCGCAGATCGAGGAGGTCACCTCCAAGCAGCTGGAACGGGTGCTGGAGGACAAGGACTACGTGGCCGTGTTCTGGTATGCGAGAAGCTGTGTCACATGTGACAAAGTTCTAGAAGAATTAGAGAAGATAGATGACGATACGGACACATTTGGCGTCGACTTTGTTAAAATAAACGACAAGAGGCTCGCGAAACAGTATGGCATTACGAAATTCCCCGCCCTCACGTATTTCCGTGAGAAGGAACCGATTATATACGAAGGAGATCTCATGGACGAGGAGAGCGTCTTGGATTTCCTGACGAGTTTAGAGGCAATGGACCTTCCAGACCGGATAGAAGAGGTTAATCAGAAAATTTTAGACAAAATTGTTGAAGACACAGAATACGTAGCCGTTCTCTTCTATAAGCCCGAATGCAAGAAATGTGCTAAAGCTCTTCAAGAGCTGGAGAACATTGACGATGAGGCCGATCAACTAGGTATTGGTTTCGTTAAGATCCATGATGAAGAATTAGCCGAAGAGTACAGTCTTGGAGACTTGCCAAGATTAGTCTACTACAGGCATCAAATACCTATTATATATGAAGGTGAATTGAGTAGGGAGGAAGATGTTTTGGAATGGCTCATTGCGAACAAGTCGACTGGGGACGAAGAGGACGTTATCGAGGATGTGACCGCTAAAACATTGAATACGCTTATAGGAAACGTTGACAACTTGGTTGTTCTATTTTACGATCATGGCGACGAAGATTCAATGACCGTGCTGGGAGAACTGGAAAAAATTGATGATGATTGTGACAGACATGGAATCCAATTCGTTAAAATCGATGATCCAAAAGCCGCGGTTGAATTCGGGATTGACGATCTTCCCTCTTTAGTGTACTTTGAAAAACAGATCCCCAATGTATATGATGGTGAACTAGAAAATGAAGAAGAAATTTTGGAGTGGTTAGTAGATCAGCTAGAAAAGGATGAAATTGAAGATGTTACGGATGAAATGCTCGATCGCCTTATCAAAGATGGAAAGACGGTAGCTGTACTCTTCTATGATAACAATGACCGAAAATCTCAAAAAGTCTTAAATGAGTTGGAGAATATTGATGACGAATGTGATCAACTAGGCATTGCTTTTGTTAAGATTGATAATGATGAAGAAGCTAAAGAATATGGCATTGAAAAAGTACCAACATTACTATATTTTGAAAAGGGAATACCAACATATTATGAAGGCAATTTAGAAGAAGAAGAAAAAGTTCTTGCCTGGCTAAAACATCAAACAGAAAGTGACGAAATTGAAGATATTACAGACGAAATGCTTGATCTCATCATCGAGAAAATGCCTTACGTCGCAGTTCTATTCTATGACAAGGACCATAAAAAGAGTCAAAAAATCTTGGCTGAGTTAGAAAATATTGATGACGAATGCGATCAGAATGATATTGCCTTCGTTAAAATCGATGACGATAAAGAAGCTAAAGAGTACGGTATTGAGACAATACCGACAATGGTGTTCTTTGAAAAAGGAATTCCTCACGTCTATGAAGGGGACCTCATGAAAGAAGAAGAGCTCTTAGGATGGTTAATCCATCAGAAGAGGCACAATGATAAAGAAGACAAACAGGACATCAGAATTCTCAATGAATTAGAAAACATTGATGATGAATTAGAAAAGGAAGGAATCGTCATTGTACGTATAGATAATGAAGCCGAAGCAAAAGAGTACGGAATTGACCATCTTCCCACTTTGGTATACTTTGAGGAAAATATTCCAGCTATTTACGAAGGAGATCTCATGAACGAAGATGAAGTTCTGGAATGGCTCATTGAACAAAAAAACAGCGCTACTATTGAAGAAGTTACCGATGAAATATTGACCGATCTTATCGAAGAACATGAATACGTAGTGGTTTACTTTAGTGGTAACTGTGAAGAAGGAGATGAATGTGATAATATATTGGAAGAATTGGAGAACATCGACGATGAGCTCGATGAAACCGGTATAATATTCGTCACGACAGAGGACATCACTCTAGCCAAGAAATATGGAATCAAAACCTTCCCCACACTTGTGTTCTTTAGAAATAAAGAGCCACTTATTTTCAAAGGTGACATCGAAGACGAAGATGAAGTCCTCGCGTGGTTAACTGACGAAAATACCCTCGAAATTCCCGGAAAAATTGAAGAAGTCAATGCTAAAATGCTAGAGAAGATTTTAGAGGAGAATGAACATATTGTAGTCTTCTTTTATAAGGAAAATGATAAAAAGTCCCAGAAAATATTGAGCGAACTGGAAAATATTGACGACGAATGTGAAGAACAGGATATCGACTTCGTTAAAACTTCCGACGAGGGTATCGATAAGGAATATGACCTCCCTGACTTACCGGCATTAGCTTTCTACAGACACAAATTCAGAACTATCTACGACGGAGACCTAATGCATGAAGAAGCCATTCTTAAATGGGTATTGGAACTTCATACTTCTCATCCTGACGTAATTGAAAATGTGGACAGAAAAACTTTGAAGGATCTCATTGACGATGTTGAACATTTAGCGGTATTTTTCTATAATGACAACTGTGACACGTGTGAGGAAATATTGGAAGAATTGGAAACGATTGACGATGACACCGACAAACATGGGATTCAATTCGTTAAATCTAAAGATTCAAAATTGGCATCTGACATTGGAATTTTCAGTTTTCCCGCTCTAGTTTACTACGAAACTGGAGTACCAATAATGTATGACGGTGACTTAAAGAACGAAAATAAGGTTCTCCAGTGGCTTATAGATCAAAAAAGTGAAGATAGCCACCAACAAAATAAACCTAATCCTAAATCCAAGGCCAATGCAGATGAACGAAAGTCGAAATTCAAAGGAAGAAACGCCGATCTAGCAAAAGAAGCCTCAGCGAAGAAATTAGGAAGTTCAAGGCTTAATCGAGAATTAGAAATTAGCAAAAAAAGCCACGAAGGCAAGTTGACACATACCGAAGATGACGATGACGACGATTTTGATAAAGATGACGATGATGGGCATTATAGTATCTTAGGCATTTTCAATAGGATTAAGAAAATAATAACAGGCGATCGCTGTTTCTACATTGGATTGGGGACGAAACCGGCTATTCCAAAAATCACATACGAACCCTACCAGTGCTGTCCCACTAAAGTTCAGAGCCCAACTAAAGTAGCGAAGGCGACCCCGACCAAGGTTCCAGCTAAGAAACTCGAAAAGGAAAGAAACCCCGGCCCTGAGAAACCAATTAAGGGCAAAAACAAGGCACTTGCTAAACCGGAAAAAGCAGGCAAAGGAAAAAAAGGAAACTTGTTGGATGAAACTGAAGTTTTAGATTGGATGGTCAAACAAAAAGAAGATGAAAGTATTGAAGAAATCGATAGAAATAGATTATCAAAGTATATAGAGGCAAAAGAATTTTTAGCAGTTGTCTTTTATAAAGAAGAGGATCCTCTTAGTCCAAGAATTTTAAGACATGTAGAATTGATAGACGATGAAGCCGCAGAATATGGTATAAAGATCGTGAAATGCAGTGATCGTCTTATGGCAAAGAAATATGGTTTTCGAAACCCCCCGGGAATAACGTACTTTAGAAAAACTAAATACATCAATTATGATGGGGACATGGATGATGAAGAAGAGATATTGGACTGGTTAACAAATCCCGAAAATATGGAGCTTACGGATCATATTGAAAAAGTTAATAAAAAAATGTTCCAAAAGATTAGACAAACGTCGGATTATGTAGCCGTATTCTTTTACAGTAATGACTGCAAACAATGTCCAAAAGTACTGTTAGAAATTGAGCACATTGATGATGATGCTGATGCTGCGGGCATAAATTTTGTCAAAATAAACGACTGGCAGATGGCGAAGGAGTTTGGAGTATTTGCATTGCCAGCCGTCTTGTTTTTCAAATTGGGTTCTAAAGATCCAGTCATATACGCTGGTGATCTCTATGATGGCCAGCAATTACTGAGTTGGCTGCTGACTCAGAAAAATCCAGCAGGAGACATAATAGAAGCATTAGAAGGACAAGAGTTACTGGATCTCATCAGTGACTCTGGTTCCTTAGCCGTGTACTTCTGGAATAGAACGTTATGTGAACTATGCAGTGTTAAATCCTCGCAACCAAAAAAGCCAAAAAAGAGTTTTAGAGAAAACGAGGACGAAGAATTACAGGAAATAGACTTTGATTCCCTCGATTGTGAGCAATGCTCTGGAATATTGGAAGAATTAGAGAATATTGACGATGACTGTGATAGACATGGAATTAAATTTGTCAAAACACAAGATTACTCGATATCTGAATCATATGGAGTGACCGATTTTCCTGTCCTTGTATACTTCGAGAATAATGTACCAAATGTTTATGAGGGTTCTTTAGCTGAAGAAGAAGAAGTGCTGCAATGGCTTATCACACAGAAGACAGAAGATCGTATTGAGTTAATTACTAGAGTAATGCTTGAGAATATGGTTGAGGAAACTCAGTACTTGGCAGTATACTTCTACAAACTAAACTGCCATATTTGTGAACACATTCTAGAAGAATTAGAAAAGATTGACGATGAATGTGACGTATATGGCATACATATGGTTAAAATTCAGGACCCTCAACTTGCGAAGAGATATTCAATTAAAACTTTTCCAGCTATGGTGTATTTCAGAAACGGAAATCCGCTCTTATTCGAAGGTGACCTACAAAATGAAGAATCTATTCTTGAATGGCTTGTGGATGATGATAATCGTGAACTTGCAGACGAAATTGAATCTGTCAATGACCGAATGCTTGAACGTTTGCTATACGAGTCCCATTTACTAGCTGTATTTTTCTATGATGAGGAAGATTGTCCAGAATGTCAAGACATTTTGGAGGCTTTGGAACAGATTGATGGAGAGGTTGACCAATACGGTATCGACTTTGTAAAAATCGCCAGCCCTGAGGCAGCGGCTGCCCACAATATAATAAATATACCCTCCTTGGTATATTTCCGAAAAAGAGTGCCAATGTTGTATGATGGTGACCTACATCAAGTGGATAGAGTGTTACAGTGGCTGACGTCTCAAGATGTTTTTGAAATAAAGAATGAAATCGAAGAAGTAAACAGGAAAATGCTTGATAAGTTATTGGAAGAAAACGAATTCCTCGCTGTTTATTTCTATGAAAACTCTGTGGAAAGCCGTATCGTTTTGGATAAGTTAGAAAACATTGATAGCGAGACAGACAATTTAGATATCACGTTTGTTAAAATGCATGATCCCCGGTATGCAAGAAAGTGGGGTGTCACTAAATTGCCCGCCATCGTGTACTTCAGAAAACGATTTCCAAGTATTTACCGAGGGGAAATTATGGCCGAAGAGGAAGTTCTTGAATGGTTGAGGAAGAATAGATTTAGGCAGCCGGAGCTTAACATATTCATGTACGCTTTGATAGCCTTATCAATAGCGTTTGTTATGTATACTGCATTTTTGCTGCAGTGTTTCAAACCTTCGCCTCAAACACAAACGCAGCATCCAAAACAAGCGTGA

Protein sequence:

>DPOGS209799-PA
MARWVPLAALLLLVTVDAARKPAPPRSEPQIEEVTSKQLERVLEDKDYVAVFWYARSCVTCDKVLEELEKIDDDTDTFGVDFVKINDKRLAKQYGITKFPALTYFREKEPIIYEGDLMDEESVLDFLTSLEAMDLPDRIEEVNQKILDKIVEDTEYVAVLFYKPECKKCAKALQELENIDDEADQLGIGFVKIHDEELAEEYSLGDLPRLVYYRHQIPIIYEGELSREEDVLEWLIANKSTGDEEDVIEDVTAKTLNTLIGNVDNLVVLFYDHGDEDSMTVLGELEKIDDDCDRHGIQFVKIDDPKAAVEFGIDDLPSLVYFEKQIPNVYDGELENEEEILEWLVDQLEKDEIEDVTDEMLDRLIKDGKTVAVLFYDNNDRKSQKVLNELENIDDECDQLGIAFVKIDNDEEAKEYGIEKVPTLLYFEKGIPTYYEGNLEEEEKVLAWLKHQTESDEIEDITDEMLDLIIEKMPYVAVLFYDKDHKKSQKILAELENIDDECDQNDIAFVKIDDDKEAKEYGIETIPTMVFFEKGIPHVYEGDLMKEEELLGWLIHQKRHNDKEDKQDIRILNELENIDDELEKEGIVIVRIDNEAEAKEYGIDHLPTLVYFEENIPAIYEGDLMNEDEVLEWLIEQKNSATIEEVTDEILTDLIEEHEYVVVYFSGNCEEGDECDNILEELENIDDELDETGIIFVTTEDITLAKKYGIKTFPTLVFFRNKEPLIFKGDIEDEDEVLAWLTDENTLEIPGKIEEVNAKMLEKILEENEHIVVFFYKENDKKSQKILSELENIDDECEEQDIDFVKTSDEGIDKEYDLPDLPALAFYRHKFRTIYDGDLMHEEAILKWVLELHTSHPDVIENVDRKTLKDLIDDVEHLAVFFYNDNCDTCEEILEELETIDDDTDKHGIQFVKSKDSKLASDIGIFSFPALVYYETGVPIMYDGDLKNENKVLQWLIDQKSEDSHQQNKPNPKSKANADERKSKFKGRNADLAKEASAKKLGSSRLNRELEISKKSHEGKLTHTEDDDDDDFDKDDDDGHYSILGIFNRIKKIITGDRCFYIGLGTKPAIPKITYEPYQCCPTKVQSPTKVAKATPTKVPAKKLEKERNPGPEKPIKGKNKALAKPEKAGKGKKGNLLDETEVLDWMVKQKEDESIEEIDRNRLSKYIEAKEFLAVVFYKEEDPLSPRILRHVELIDDEAAEYGIKIVKCSDRLMAKKYGFRNPPGITYFRKTKYINYDGDMDDEEEILDWLTNPENMELTDHIEKVNKKMFQKIRQTSDYVAVFFYSNDCKQCPKVLLEIEHIDDDADAAGINFVKINDWQMAKEFGVFALPAVLFFKLGSKDPVIYAGDLYDGQQLLSWLLTQKNPAGDIIEALEGQELLDLISDSGSLAVYFWNRTLCELCSVKSSQPKKPKKSFRENEDEELQEIDFDSLDCEQCSGILEELENIDDDCDRHGIKFVKTQDYSISESYGVTDFPVLVYFENNVPNVYEGSLAEEEEVLQWLITQKTEDRIELITRVMLENMVEETQYLAVYFYKLNCHICEHILEELEKIDDECDVYGIHMVKIQDPQLAKRYSIKTFPAMVYFRNGNPLLFEGDLQNEESILEWLVDDDNRELADEIESVNDRMLERLLYESHLLAVFFYDEEDCPECQDILEALEQIDGEVDQYGIDFVKIASPEAAAAHNIINIPSLVYFRKRVPMLYDGDLHQVDRVLQWLTSQDVFEIKNEIEEVNRKMLDKLLEENEFLAVYFYENSVESRIVLDKLENIDSETDNLDITFVKMHDPRYARKWGVTKLPAIVYFRKRFPSIYRGEIMAEEEVLEWLRKNRFRQPELNIFMYALIALSIAFVMYTAFLLQCFKPSPQTQTQHPKQA-