Monarch geneset OGS2.0

DPOGS207001
TranscriptDPOGS207001-TA3246 bp
ProteinDPOGS207001-PA1081 aa
Genomic positionDPSCF300001 + 1002045-1014361
RNAseq coverage1154x (Rank: top 11%)
Annotation
HeliconiusHMEL0086730.087.40% 
BombyxBGIBMGA012921-TA0.069.00% 
DrosophilaMhcl-PB8e-14848.17% 
EBI UniRef50UniRef50_B4GMK89e-17444.93%GL12404 n=8 Tax=Eukaryota RepID=B4GMK8_DROPE
NCBI RefSeqXP_002000205.10.044.98%GI10100 [Drosophila mojavensis]
NCBI nr blastpgi|3227923250.049.49%hypothetical protein SINV_06162 [Solenopsis invicta]
NCBI nr blastxgi|3227923250.050.11%hypothetical protein SINV_06162 [Solenopsis invicta]
Group
KEGG pathway 
Orthology groupMCL10340 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207001-TA
ATGCCTCACGTCAGTTCTCCTTCTCGTACTTTGGAAGGGAAGACCGGTAAGCCGATAAGCATTAAGATACCGGAACAGAGAACAAACGGCTCCAATGGTATACTTTCACCGAATTCAAATGGCCCTCTGTCGCCCCCGATACTGGATGCCACACCTAAGAAGCCACCGAAAGCCCCCCTGGGACGAGCTATACTGCCGGTGAAAAAATCTAAGACGCTTCCAATTAATTTTGAAGAAAAAGTCCCTCCTAAGCTATCTTCGGGATTTTTAAGTGGTAAGAAATTTGTTGTTACTAAACCACTCAAAGAGGTCAAAACAATACCACAACAAACGGAGAACGGAAACTATTTATCAACTCGATCTCTGAGTCCTGCTTGTAAGAGAAGTACATCGGTTACAATAGAAACAGGTTCGATCGCGGAGCTTGCTCGGAGACATGGCTCCAACAATATCCTTACGTCGCGATCACCAAGTCCCTCTTCCTACGGAAGTGTCAAAAGTTCTGGAAGCTACTACTCGACATGTAGCAGTAACAGCACATTCGTTCCATTATCCCGTTCATCGAGCTTAACTTCTACGCTCTATAACAGAACTCCTACTCCGCGAAGAATGTACCCACAGACCTCTGACAACTCGGATAATGTCGACCTCAAAGAACTGACGTCTAAAGAGCAAGCGCCCGTCGTCTTCGATGCGAATCTATCATTTGTGCTTGGTTGTAAGAAGCAGCCTGTACGACAAAAGTTCAAGCCGATAGCTGCTCATCTTGAAGAGGGAACCTCGTCCAATTACTTGAGTACTAAAATAGACAACTTTCTGAAAAGAACCGATCATGTTATGGATGAGTGGAGAAGAATGGGCCACAAAGATGAACCGGATCTTGAAATGTATTATGATGGAAAAAAGAGAAAAATTGGTAGATCAAAATCTGCTACAAATATTATGATAAAAGGGTTCACTCTGTTCAGTCGCAGCGGAAGCCGAGCTAGTAGCGTCTGTAGGTCCTCGAGGGGCATATCTGAAGATCGTACCACAGTTTCTGAAATGGATGAGTTATCAGAGGTTGGCGGTGAGTTGTCCGATGAGCGTGCGGCAGCAGCTCTAGCGACCGAACGAGCTGATGCAGAGGCTGCGGAGAGACTCCGGATTGAGAGAGACAATAGAGAATTGCTGGCTAACAACCAACGTCTACAACAGGCATCCGAACGCTTGGAGCTGGAACTCCTTCATTGGAGATCTGCGGAGAATGGTAACGCTGAGCCGTCAGACTCCGAGGGTTCGGAAGCGGACGGAAGTGCCAGCGGCGATAAGTACAAGAGGCGGTTCGAGAGGGCACACAGAGAGCTGCAGCTGCTCAGGGCCCAGCTGAGGAGACAGCACGAGGACGACCTGGAACAGCTGGTGACGGTCAAGAAACAGCTGGAAAAGAAGGTTCAAGATGCCTACGAGGAGGTCGAGGAACAGCGTGCGGTCGCTGCACAATGGAAACGCAAACTGCAAAAGCTCACCAACGATATGGCTGATCTGAGATCATTGCTTGACGAACAAACGTGTCGTAACAATCTCCTGGAGAAACGTCAGCGGAAGTTTGACGCGGAGCTCCACAGTGCCCAGGAGGAGTTGAAGAGAGAGAGAGCGGCCAAGGAGAGACTCTCCCGGGAGAGAGATCAGGCACACGCGGAGAAATACGCACTGGAACAGAGTCTATCAGAGGCTCGCATGGAGGTGGAACTTAAGGAGGAGCGTCTTCTGTCAGCGGCGAGGGAGTTGGAGGAGAGGGGAGGAGGGGACGAGGTGGCCGCTCTCAGGAGAACACGCGGCGAGCTGGAGAGACGAGTGAGAGACCAGGAGGAGGAGCTGGACGAGCTGGCCGGACAGATACAGTTATTGGAGAGTTCTAAGCTTCGCCTGGAGATGCTTTTGGAGCAGCAGCGGAAGGAGGCTCGCCTGGAAGCTGCCGCTAGAGACGACGAGATGGAAGAAACTAGAGCGAACGCCTCTAAGAAACTTAAAATGCTGGAGAGTCAGCTGGAGAGCGAGCATTCAGAGCGGTCGCTGCTCCTCCGAGAGCGACACGAGCTGGAGAGACGGCTGGCAGCGCTGGAGGAAGCCGCTCGCCAGGAGACACATGAACAGGGACAGCTGGTCATTAGACTTAAGAGGGATGTGAAACGTTACCGCGCCCTGCTGCGAGACGCTCAGACGATGTTGGAACAGAAGGAAAAGGAAGGTGGAGGGAAGACTCAGATAAGACATTTGAAAAACCAGGTGGAAGACTTGGAGTTGTCGCTCCGCGCTGCTAACAAGGCACGCTCGACAGCTGAGAGTGAGGCGAGCGAGGCGACCGCAGCCCTAGAGGAGACCGCGCGCGCCCGAAACGAGGCCGTGGAGAGAGCGCATGCCGCCACGAGGGACGCCGCCGCCGCACGCGCAGCCCTAGACGACGCTGAGGAGGAAGCCGCCGAGCTGCTGAAGAAGTATCGGGCGAGTACGAGCGCGCTGTGCGCCGCTCAGGCAGCCGCCCGCGAGGCTGAGTCCCGAGCGGAGGCTGCGGCTGAAGAAGCACGTCAGGCGCGAGAGAAACTCACTGAGATGACCACTAGACTGGCACACGCTGAGGCCGGTCACACACACGAACAACACGAAGCCGGCAGACGACTCGAACTCAGGAATAAGGAATTGGAATCGAGTCTGGAGTTAGAAGCGACGTCCCGGGCCCGTCTTGAGGGACAGTTGGCTAGACTGAGGGACTCTCACGAACAACTCGCCAGTGAACTTAGCGCAGCACGCGCCAAAGATCACCAGGCCGCGGAGGAAGTTAGAAAACTTACAAGACAACTGAGGGAATTGAAGGAAGAGAACGCAGCTCTGTCGTCTAAGCTGAGCGAGGTGTCGCGGGCTAAGAGCACGGCAGAGGCGGCAGCAGCGGCCGCAGCAGCGGAGGCCTCCGCCGCCCGGGACGAAGCTCGCCTGGCGGCACGTCGTGCGGCAGCCTTACAGGAAGCGATCGCGGGGGATCTCTCCAGCCCCGGGGACTCCAGGGACACTGACAGCGACAACGACAGTTACAGTTCTGACGAATCGATAGGGACGTTCCTCGCCAACCACAAGCTGAGCCCGTCAGTGCCTTCGCGTGCCAGTCTTCATCTAGACTCACAGAAATCCCAAAGCCCGGAAGGACGTCAGAGCAGGTCGAGCGTCGGTTCTAGCACTAAGTTGAGTCCAACGAAGGAATCGTTCGCATAA

Protein sequence:

>DPOGS207001-PA
MPHVSSPSRTLEGKTGKPISIKIPEQRTNGSNGILSPNSNGPLSPPILDATPKKPPKAPLGRAILPVKKSKTLPINFEEKVPPKLSSGFLSGKKFVVTKPLKEVKTIPQQTENGNYLSTRSLSPACKRSTSVTIETGSIAELARRHGSNNILTSRSPSPSSYGSVKSSGSYYSTCSSNSTFVPLSRSSSLTSTLYNRTPTPRRMYPQTSDNSDNVDLKELTSKEQAPVVFDANLSFVLGCKKQPVRQKFKPIAAHLEEGTSSNYLSTKIDNFLKRTDHVMDEWRRMGHKDEPDLEMYYDGKKRKIGRSKSATNIMIKGFTLFSRSGSRASSVCRSSRGISEDRTTVSEMDELSEVGGELSDERAAAALATERADAEAAERLRIERDNRELLANNQRLQQASERLELELLHWRSAENGNAEPSDSEGSEADGSASGDKYKRRFERAHRELQLLRAQLRRQHEDDLEQLVTVKKQLEKKVQDAYEEVEEQRAVAAQWKRKLQKLTNDMADLRSLLDEQTCRNNLLEKRQRKFDAELHSAQEELKRERAAKERLSRERDQAHAEKYALEQSLSEARMEVELKEERLLSAARELEERGGGDEVAALRRTRGELERRVRDQEEELDELAGQIQLLESSKLRLEMLLEQQRKEARLEAAARDDEMEETRANASKKLKMLESQLESEHSERSLLLRERHELERRLAALEEAARQETHEQGQLVIRLKRDVKRYRALLRDAQTMLEQKEKEGGGKTQIRHLKNQVEDLELSLRAANKARSTAESEASEATAALEETARARNEAVERAHAATRDAAAARAALDDAEEEAAELLKKYRASTSALCAAQAAAREAESRAEAAAEEARQAREKLTEMTTRLAHAEAGHTHEQHEAGRRLELRNKELESSLELEATSRARLEGQLARLRDSHEQLASELSAARAKDHQAAEEVRKLTRQLRELKEENAALSSKLSEVSRAKSTAEAAAAAAAAEASAARDEARLAARRAAALQEAIAGDLSSPGDSRDTDSDNDSYSSDESIGTFLANHKLSPSVPSRASLHLDSQKSQSPEGRQSRSSVGSSTKLSPTKESFA-