Monarch geneset OGS2.0

DPOGS208372
TranscriptDPOGS208372-TA5322 bp
ProteinDPOGS208372-PA1773 aa
Genomic positionDPSCF300146 + 50537-57561
RNAseq coverage1773x (Rank: top 7%)
Annotation
HeliconiusHMEL0072330.064.14% 
BombyxBGIBMGA012360-TA0.057.69% 
DrosophilaCG43078-PH2e-2627.23% 
EBI UniRef50UniRef50_D6WEE61e-3630.83%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WEE6_TRICA
NCBI RefSeqXP_972005.13e-3730.83%PREDICTED: similar to AGAP005739-PA [Tribolium castaneum]
NCBI nr blastpgi|2700039103e-3630.83%hypothetical protein TcasGA2_TC003200 [Tribolium castaneum]
NCBI nr blastxgi|2700039104e-10428.41%hypothetical protein TcasGA2_TC003200 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL20554 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208372-TA
ATGGAATCTCAATTGTCAGCAGATGAGAATAAAAATTCTCCATTGCCTATTATAACACATGAATGGGAGGATTTAAGAAAAGCTCGTGAGGCAGGCGGCTATCCTTGGACTCACCTTTTGAAAGCACCTTTAGAAGGAGAAATTACTGCAGAAGACATTATAAGATCTACTTCTCCAAGAAGAAGCATGTCTCGAGAATTTTCTAAATCTCGAACTCATTCACCAGTTGATGAAACAGTCCAAAAAATTCTTAATCTTGACTCATCACCAGGTAGTAGCCGAAAACATGGTGAAGAAGGTGAAGATGAAGGAGTGCATATACCAAATTTTTCAAAAGAAATTTCATTGGACTCAGACGAAGATGTTGTTAACCAAGCACTGGCTACAAAACAGTTAGAAAACCAAAATACTCAACATTTGATATCGCCACCAACGGAAATTCCAAAAAATGTCTCACCAAAAAGTATTCTTAAAAGAAGGACACCAGAACCACAATACTTTGGGAGTACTGAGACACGTGAAAAAACAAAATGTTGTCACCCGTTAGTTAATAAACTTAAACACTTAGCTGACAAAACGTTACACAAGTTAGAAAGAAACGATCATGAAAAATCTCCCCAAACAAAAAGAAAAAAAGCTGAAGAGCCTCAAGAAATAAGACAACTCAAAAACTCTCCTAGCGCTATTCGAAGACAAAAATTTAGTGCTATAAAATTGGGAGACTCAGATGAAATGAGCAGAAATATGAGCATAGATATCCCAACGCCTCTGAAAAAAAAAGAACACATATACGAAGATATAGAAGAATCAAAAGAGTTTGATAATGTAAAATCATTACAGGATCCTCCGGATATTGATAACGATGATTTAAAAACAAGTTCAAATAAAGAAAACGACAGTCTTCAAGATAAAAAGTCGAATACATCTCAGGATCCTAGTTTAATAATTGAAGACAATGCTTCACTAGACGCTAATGAACATGTGATGGAAAAACTTCACCAGAGAAGTAGTATTAGTAAAGAAGTTGTAATGTTAGACGAAGAAATAGTTCCAGCAGAGTCAAACACAGAACTAATGTACAAATCGGAAAACGATATTAGAAAAAACAATGAATCTCCAAATATTACCATTACAGAAATAAATGATGATGACATCACCATAGAACCGTCTTCTTCGATACCTATAGAATCATCTCCAAGTCCCAGTCCTCGTGTTATTGATAGTAAACAAGACTTCATGGATACAGACTGGTCTAGACCCAACAGGCCTACTAGAGAAGGCAGTGTTCAATTATATACACCAGAAGTACGTTGTGCATGTCAGTTTTTCCAATGTGATTCCCTATTACCGGAAAGAATAAATCCCGTGGGCACTAGACCAACTTCTCGAGCTAGTAATATTATTCAAAATATTAGTGATCACGAATACGAAATTATAGAAAAACCTCCCCCAAATATAATTTATACGGCACCAAGTATCGACAATAAACCCAACATGGTGGTAACGGACGACGATGAATTATCAAGTATCTCGTCACTGGAAAAAAAATACCTTCATCATACATCCGACGAGGATGAGCCAATGGAGCATAAGGAACAAATTAATTGCGAAAAAGATGAAAATATTTCAAACGCTGAAGCATTACAAAAAGGCATTGAAGAACGCTTCTTTGTAAATACTCCCTCAACGACATCTAGAACTGAATGTGAAGTAAGTACCAGTCAAGTAGATTCTTCGGCCCTGGAAAATTTACCCTTAAAGAGAAGTAGTAGAAATAAAAATTCTGAAGGAATTGAGTCGAAAATACAGCAGCGTATAAAAGAAGGCACGGGAAAAATAAAAAGTCAAGCTGGAAAATTAAAAACAAAATTAAATACTATAAAAAACAAACAAATGCATTTTCCTGAAAGATCAAAAATAAAATTGCAAGAAAAACCAAAGCCTTCTAACGCTGATCGGTCGAAATCAACATTACCAGAAAAACGAAAGTTTAGTTTGCCTGACAGACCAAAATTTAAAAAGATTAATTTTTCTGAAAAACTATCGTTTGGTGATCGAAAGAAATTCAGTTTTCCGGAACGTCCCAAATTTAATATGCCTGATCTACCTAAGTTTAAAATGCCTGAGAGACCTAAAATAAACTTCCCAAGTCTTGGTAGGAAAAAAGTTGATATAAATAAGTCTGAATCTTCATCCGATTTACAAAATGTCTCCGTAGAGTTTGAAGCTAAAACATATCCGAGACTTTTTAATAGAAAGAAACAATATATTCCAAAAACCTCATCTTCACCGACGCTTAACAGAGAAGATACCCCACCAGCAACATTTACTTTTACTAGAGTAAAAAAGGCAACGGATAACCAAGAACCGTCACGTCATATCCCAGATAGTCCCGAAGAACCCCGAGAATACGGTAGTTTAGACAAGGAGTGCGAATATACCGATGACTACAAAATAGAAAACAGACAAAACTTTTGTACCACATACGACTTTGATAAAGTAGATCAAGTAGATCAAGAATTTGCAGATGAAACAAATAGTCATGATGAGTTAAGAATACAAGTTTCCGAATCAGATATATCACCGCCACTACAAGAAAACACACATATAAACGAATTAAACAATGACGAATTTTTTGTACGACCGAGAGGTATTTCTCGTGAAAATATACAAGTTAGAGAATACTTAAGCGACGAAATACGACAAGCATTTAAAATTCCCAAAAACGTGCTCGCTGATATGTCCAACGAAGACCAATTTAATAACAAAAATATATATGCCAACGATCCAGAAGACCACGACATAGCATTAGATGGCCAGCCTATCAGTTATTCCACTGAAGATATAAATGATAGAGATGATGGTTATTACACATTCCCGCCAGTACGACCTTCAAGAGCAAAACGAAAGAAAAAGGATGCTGAATCAATTAAATATATAGACGACAGTATAAATGCAAGCATGCAATTTAGCGAAGTTGATTTAGATCTAAGACCTACTACTGATCTTCATTCTATCCATGAATACGCCAATGATGATGTTATCGAATATCCTGATAGTATGCCAATCCAATCTCAAACCTTACCTATGCCACCACGAAGGAAGAAAAAATCTTTAAAAACTGGTATCAGAAATACCTCTTTAAATGATGTGAACATGTACCCAGCAGAGAGATGGCAATCAAAACACGATGGATGCGATGATATTATCGTCTACAGAACAGAACACGAGTACATTGTTCCACAAGCTGATTTATCCCAAGGTGATGTAACAGAACAAAGTCCATTACCACCGAGAAGAAATCGATCCAGAAGTTCAAGGACCACTTCTGTATGTGATGATGACCGTACATCACATGGTGCGGAATCGCTTATTCTCGATGCTCATATTTCAACAGCTGACAACGCTTTATCTGAAAGTGATATTCAGCGTGAAAGTCCAGGTTATGCAACTGTTGACAAAGGTAATTTTACGCCAAGTAAAGGTGCTAGACGGTCATTAAGCAAAACTCCACCAGCAAGACGCCGTAAGAGTAATAGTTCTGAAAGAAAATATTACACAGTCTCTAGTCAGAAAAGCAGAATGCCAGATCGACCGCCGAGAAAAAAATCCTCTACGAGTCTTATGACCCTTGACAGCTTCACAAAAGAATCTATAAATGGTGATCAGACCCAATATGTTGAAATAGACAGATCGAAAATGGAAGATTCACACAAAGACCTAAAATCGGGGGCAATCGTCAGTAAAATGAAAGACAGACCATTACCTCCACCTCCTCGTCCTCCCAGAGGACCAAAGCGTAAAAAACTCTCGCAAGAGGAAGAAAGCCAAAAATCAGCATTAAATTTATCAGATTATCTTGATGTAGTAGAAATTGAAGTTTCGACACAAACAGATCCTTTGCCAGATGATGTTGACTTCGAATTTGGAATAGATGACAATCTTGATTTGTCTATGTCTAGTTCCCTTAGAGACATAATTGATGAAGAATCAATATTAGGAAAGATTCATGACAAGTCAATTACATTAGAAGCGGATCGCCGTAGTTCTAGGCCAGCATCACGGTCTGAGAAGTCTCTAAAGTTGTCGGATCCTAAATTAGGGGAATTTTCTAAATCAAGCCTCGGTAAGACATCCCCTACTGTTATATTAGTGGAAAAACGGGTATCTAGCCCAACGAGAATAGACGAAAAAGAGGTAATATTAACAGAAGCATCATTGACTGTACAGCCTATTGATATTGATGATTCACAAGTACCAGATGTTCCGCCTTTACCAAAATCTAGAGACACTCTAACTTCAACTATAAAACCTAGAACTGAGCCTGAGATTCCTGAAAGTGAGAAAACTCTTGATAAAATTGCTGAAACTAACAAAGATATTTTGTTAGATAACTTAGTAACACAAAGACTTCAAGTTCGAGATTTAGACGTTGGTCGATTAAATGTTTCAGAACTGCAAGCATCAAAAATTCTTGTTTCCGATATTGAAGGCATGACTTTAAATGTTAACGAGTTAGACTCTAAGTCTGGTCATATTTCAATAACTGGAATAGAGTTCTCTCAATCTGTAATCGATGAAATTGTTAAGAAATTTACCGAAATGTCAACTTCTATTGTTCCTAACACTCAGATAGTAGACACCCAAAATATTGAGAGGCCAATTAGTAGGGAAGAGGAAACGCAAACAGATACTCCTTTACCTGATAAAAAACAAGAAAATATTATTTCAGAAGAGATTAAAATTGACTCATTACCATCCTCAACCGCCAGAAGTAGTGAATATATTGAGGAAATAACTGTCCCCCCACAACGACCTCCGCCTCCTGATTTGACGCCTTTATTATATTCCTATCTGCAGGATCTAACGATTACGTCATCATTACCTCATCAACAACCAATACTGCGGGAGAGGCATTACAGTGACTTTCATGAACCACAACTTCCTTCACCACAGCCACCAACACGTCGAGCTAAAAGAAAACCACCCGTTTTGCACAGCGAATCAAGTTCGGATGACGTAAAACCTCGACCGTCGCCCAGAAGAATGCCACCTCCAGCCCGAACTCAAGAACCAACGATAACTGAAGCTGGTGTCCAATTCTTACGGGTATGTCAGAATTCAATAAGCAGAACGTTTAGAAATATTGTGAACACATTTACGTCTTACATAAGCGGAACTCAAAATAAACATGATATGCAAGTCGCCATGGTTATATTCCTCGTGTTAATAGCTGGTTTAATAATGTTCGGACTCAGCGATAGCCGTACGATTCATCACCATCATTGGGAATTTTTTAATCCACCAGATAATAAGCAATAA

Protein sequence:

>DPOGS208372-PA
MESQLSADENKNSPLPIITHEWEDLRKAREAGGYPWTHLLKAPLEGEITAEDIIRSTSPRRSMSREFSKSRTHSPVDETVQKILNLDSSPGSSRKHGEEGEDEGVHIPNFSKEISLDSDEDVVNQALATKQLENQNTQHLISPPTEIPKNVSPKSILKRRTPEPQYFGSTETREKTKCCHPLVNKLKHLADKTLHKLERNDHEKSPQTKRKKAEEPQEIRQLKNSPSAIRRQKFSAIKLGDSDEMSRNMSIDIPTPLKKKEHIYEDIEESKEFDNVKSLQDPPDIDNDDLKTSSNKENDSLQDKKSNTSQDPSLIIEDNASLDANEHVMEKLHQRSSISKEVVMLDEEIVPAESNTELMYKSENDIRKNNESPNITITEINDDDITIEPSSSIPIESSPSPSPRVIDSKQDFMDTDWSRPNRPTREGSVQLYTPEVRCACQFFQCDSLLPERINPVGTRPTSRASNIIQNISDHEYEIIEKPPPNIIYTAPSIDNKPNMVVTDDDELSSISSLEKKYLHHTSDEDEPMEHKEQINCEKDENISNAEALQKGIEERFFVNTPSTTSRTECEVSTSQVDSSALENLPLKRSSRNKNSEGIESKIQQRIKEGTGKIKSQAGKLKTKLNTIKNKQMHFPERSKIKLQEKPKPSNADRSKSTLPEKRKFSLPDRPKFKKINFSEKLSFGDRKKFSFPERPKFNMPDLPKFKMPERPKINFPSLGRKKVDINKSESSSDLQNVSVEFEAKTYPRLFNRKKQYIPKTSSSPTLNREDTPPATFTFTRVKKATDNQEPSRHIPDSPEEPREYGSLDKECEYTDDYKIENRQNFCTTYDFDKVDQVDQEFADETNSHDELRIQVSESDISPPLQENTHINELNNDEFFVRPRGISRENIQVREYLSDEIRQAFKIPKNVLADMSNEDQFNNKNIYANDPEDHDIALDGQPISYSTEDINDRDDGYYTFPPVRPSRAKRKKKDAESIKYIDDSINASMQFSEVDLDLRPTTDLHSIHEYANDDVIEYPDSMPIQSQTLPMPPRRKKKSLKTGIRNTSLNDVNMYPAERWQSKHDGCDDIIVYRTEHEYIVPQADLSQGDVTEQSPLPPRRNRSRSSRTTSVCDDDRTSHGAESLILDAHISTADNALSESDIQRESPGYATVDKGNFTPSKGARRSLSKTPPARRRKSNSSERKYYTVSSQKSRMPDRPPRKKSSTSLMTLDSFTKESINGDQTQYVEIDRSKMEDSHKDLKSGAIVSKMKDRPLPPPPRPPRGPKRKKLSQEEESQKSALNLSDYLDVVEIEVSTQTDPLPDDVDFEFGIDDNLDLSMSSSLRDIIDEESILGKIHDKSITLEADRRSSRPASRSEKSLKLSDPKLGEFSKSSLGKTSPTVILVEKRVSSPTRIDEKEVILTEASLTVQPIDIDDSQVPDVPPLPKSRDTLTSTIKPRTEPEIPESEKTLDKIAETNKDILLDNLVTQRLQVRDLDVGRLNVSELQASKILVSDIEGMTLNVNELDSKSGHISITGIEFSQSVIDEIVKKFTEMSTSIVPNTQIVDTQNIERPISREEETQTDTPLPDKKQENIISEEIKIDSLPSSTARSSEYIEEITVPPQRPPPPDLTPLLYSYLQDLTITSSLPHQQPILRERHYSDFHEPQLPSPQPPTRRAKRKPPVLHSESSSDDVKPRPSPRRMPPPARTQEPTITEAGVQFLRVCQNSISRTFRNIVNTFTSYISGTQNKHDMQVAMVIFLVLIAGLIMFGLSDSRTIHHHHWEFFNPPDNKQ-