Monarch geneset OGS2.0

DPOGS213271
TranscriptDPOGS213271-TA3114 bp
ProteinDPOGS213271-PA1037 aa
Genomic positionDPSCF300264 - 24802-35424
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0166630.070.42% 
BombyxBGIBMGA001187-TA0.068.95% 
DrosophilaSpt20-PA6e-10841.59% 
EBI UniRef50UniRef50_Q7PUY91e-12847.56%AGAP012403-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=Q7PUY9_ANOGA
NCBI RefSeqXP_320152.43e-12947.56%AGAP012403-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583001545e-12847.56%AGAP012403-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3838608171e-13236.39%PREDICTED: uncharacterized protein LOC100879308 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[99-272] IPR0219504e-28Spt20 family
Orthology groupMCL17114 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213271-TA
ATGGATGGTTTAATTCACGCGGCATTGGAAGCAGAGGTAATACTTAACCGGGCTAAACATGTGAATACAAAGCTAACAAATTTTGACAGTAGTGTCTCAGATCATAAAATGACTTGGACCCAAGAAAAGATGCACCTAGCGGAAACTGTTGATGAATCAAGGATGAAGTTTCAGAAAAATTCTGTCAGTGGTTCAGCAAAAACTGCAGAAAAATTTGATTTATTTAAGAAATTACATGAATTGTACAACGAGTTAAGTAGAGATGAAACTTCACAAGCAAACTATCAAGGGCTAAAGACGACATCATATTTATTGGAAAAGTTGTTAGCAACCTACAACCTTAATACATTAATAATTAATTTATATCCTGGTAACAAGGGTTACTCTCTGTCCCTTAAGATAAATGGAACTGCTCAGAACATAAACCCACCGGATGCAAATGCATCATCAAGTCAAGAAGAAACTTTAATAGAAACTCCTCGTTGGCCATATGAAGAAGAAGAGTTATTAAGTTATATAGATAATGAAGAATTGCCCGTAGTCTTGTTAGATCTCCTTGAGTCCGAACACTCGTGTTTATTCTACTCTGGCTGCATCATAGCTCAGATAAGAGATTATAGGCAAGCATATCCAAACTTTGTCTGTGATACACACCATGTGCTCTTAAGGCCCACAAACCAGAGTATAATAACGGATGCGATGTGTATCGGTCGAAGCGGTTGGGCGGGTGAAGAGCGGGGGGCTTTGGAGGCTGTCGAGGCGGCGTTAGTGCACGCGGCCGCGCCCCCGCTGTGTCTAGAGCCTCGGCCGGCGGTCGGTCTGCTAGCGGCTAGGCTTCATGCTGCCCCAAGACTGTTTAATACACCGAGGATACGACGTCAGGCTCGGAGATTTTCACAGGTGTCGGTTAATAGAAAAAGGAAATTGGACCAGTTCACTCATTATCATGGTCTGGAGTTGTTGGAGTTAATACACCGTCAGAGAGCGAAAAACAGCCGCCAAACTGTTCCACACACACGGTTAACATCGAAATTCCCAAAGAAACCACCGGAGGTGTTCAAACCTATAGAACCTCCAAAAATGGATCCGTTGCCGCTCGCACTACCGTCTGAACCGAACGCCCCGTTACGGTTGGCCCGCGCCTATGAGCGTCCACGCCCCACACCGGACTGCCAGCCGCAGTTGGTGGAAGAGTACATCCTGGAAACTGAGAAGAGCTCCCCGCACGCCGGAGCTGGTTTCTTCCACATCAAACTGTCTATACTACAGAGGCCATCCGACCAAGAGTTCCTTGGTGAACTGTATGTTGATAGAGATCACGTGGAAGGTGAAAGAAATGGAGCAGCCTGTAGATTCTCATTAGGTTCGCGACTTCAAGCCAACAAATACATACAACAGTTCACAGAAATTTTCACAGAGGAAGGTAGAAAATCTGTTCGGATAAAGCATATTGTGCCCGGACAGTTACCGAGAGTTTCCTTCACAGGAGGCATGAGAGATATGCAAAGAACAGCACAAGCGAACAACTCCACAGTTCAGACTCATGCCACAACCGTGCCTGTTGTAACCTCCGCCATAGCTGCTACACCCAATGCCCATTCAAACGCAAGACAGCTGCCTATACTGCAGGCACAATTGCAACAAGTTGGCAATGTTAACGTGACAGCCGTGGGCACTGTTGTGGGCAGTGTGGGCACGGTGGGCAACGTGGGCACAGTGGGCAGTGTGGGCAACGTCGGCACCGTGGGCACCGTTGGCAACGTGGTAAATGTTGGCCCTGCGACCGGTGTAACAGAAGCATTGAAACAGCAACCATCTCCAACAACACCCAGGCTTTCGCCACAGGCGTCAACGAATCAATTGCTAGCACAACAGCTCACTAATCCGCCGCAACCTCTCAACCCTCAGAAGATGCAATCAGCCATCATACACATACAGCATCCCTTGATGTCGTCTTCGGGAACATCACAGGTTCAGAGTATACAGTATACAAATACAACGACCAATCAGCAGAAGACGACTATAACTAAAGCGAGATCGACGAACCCAGCGATCAACGCGCTCGTTACTAGTCTTATGAATTCAGCGCAACAGTTTCAGCAAGCGGCAAGTCAAAATGCGGCTAAAGCGGTGGTGAGCACTTCAAGCAGTAACGCTACCATCCTGAATCTATTGAACAGCGCACCGGCTGCCATGACTCACGTCACCACCAGCGATAGCGACACACACAAGCTTCTGACCCGAACTGTTTCTATAGCGGGGGCTAGACTCATAGCTTCGACGAGCAGTCATACATTACCTACATATACACAACAGGTAATGACTGGTTACACACGTGAGAACGAGTCAACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCCGAGCCGTCCTCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCACAACCACAACCTGTCTGTCATTTGCAGGTAATGACTGGTTACACACGTGAGAACGAGTCAACGAACGTGTCTAGTAGTGAGAGTGCTTTGCTAGAGAGGTTGATGGGTCCCGAGCCGTCCTCAACACCGCCGTCTCAGACGCCACAGACGGCACAACCACAGCCACAACCACAACCACAACCTGTCTGCCATTTGCAGGGTCTAAGTTTAACATCGCTTCAGGGCCTACAGAGTATCCAGGGGTTGCAAAACGTTCAAGTCCAGATACCTGGTCTATCTGCGCCTATATCATTGTCACTGAACGTGTCAGGTGCACCCAGCGGGTTGTTGGTCTCAGTGCCGCCTACTACTTCTGTGGTACTCACAAATCAGCCTTCAGTGTTGTCATTGCCTATAGCTCAACTGATGTCCGGCGGTGTGAAGGGCGGCGTCCGCAGTGGATCGGTTCAGGTGGTCCGAGCGCCGCGACCGGCGAGACTAGTCCGACCCACCCGACCGTCGCTGCCAAATATTACAAACATCACCAATATCACTAACATGACGAACATACCATCCACTCCGGGCACAACTCAGTTTATAGCCCAATCGCAGGGACAGAGTCAAGTGTTGAACGCCCACCAGGTGCGAAGAAAATCAAACCCTGACAGTTCATAG

Protein sequence:

>DPOGS213271-PA
MDGLIHAALEAEVILNRAKHVNTKLTNFDSSVSDHKMTWTQEKMHLAETVDESRMKFQKNSVSGSAKTAEKFDLFKKLHELYNELSRDETSQANYQGLKTTSYLLEKLLATYNLNTLIINLYPGNKGYSLSLKINGTAQNINPPDANASSSQEETLIETPRWPYEEEELLSYIDNEELPVVLLDLLESEHSCLFYSGCIIAQIRDYRQAYPNFVCDTHHVLLRPTNQSIITDAMCIGRSGWAGEERGALEAVEAALVHAAAPPLCLEPRPAVGLLAARLHAAPRLFNTPRIRRQARRFSQVSVNRKRKLDQFTHYHGLELLELIHRQRAKNSRQTVPHTRLTSKFPKKPPEVFKPIEPPKMDPLPLALPSEPNAPLRLARAYERPRPTPDCQPQLVEEYILETEKSSPHAGAGFFHIKLSILQRPSDQEFLGELYVDRDHVEGERNGAACRFSLGSRLQANKYIQQFTEIFTEEGRKSVRIKHIVPGQLPRVSFTGGMRDMQRTAQANNSTVQTHATTVPVVTSAIAATPNAHSNARQLPILQAQLQQVGNVNVTAVGTVVGSVGTVGNVGTVGSVGNVGTVGTVGNVVNVGPATGVTEALKQQPSPTTPRLSPQASTNQLLAQQLTNPPQPLNPQKMQSAIIHIQHPLMSSSGTSQVQSIQYTNTTTNQQKTTITKARSTNPAINALVTSLMNSAQQFQQAASQNAAKAVVSTSSSNATILNLLNSAPAAMTHVTTSDSDTHKLLTRTVSIAGARLIASTSSHTLPTYTQQVMTGYTRENESTNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQVMTGYTRENESTNVSSSESALLERLMGPEPSSTPPSQTPQTAQPQPQPQPQPVCHLQGLSLTSLQGLQSIQGLQNVQVQIPGLSAPISLSLNVSGAPSGLLVSVPPTTSVVLTNQPSVLSLPIAQLMSGGVKGGVRSGSVQVVRAPRPARLVRPTRPSLPNITNITNITNMTNIPSTPGTTQFIAQSQGQSQVLNAHQVRRKSNPDSS-