Monarch geneset OGS2.0

DPOGS201597
TranscriptDPOGS201597-TA3801 bp
ProteinDPOGS201597-PA1266 aa
Genomic positionDPSCF300152 + 61817-84054
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0081153e-9858.97% 
BombyxBGIBMGA012210-TA0.060.96% 
DrosophilaCG15145-PA3e-1225.14% 
EBI UniRef50UniRef50_UPI0001CB9EC31e-5529.77%UPI0001CB9EC3 related cluster n=2 Tax=unknown RepID=UPI0001CB9EC3
NCBI RefSeqXP_967867.29e-5732.25%PREDICTED: similar to AAT1-alpha [Tribolium castaneum]
NCBI nr blastpgi|1984374813e-5730.74%PREDICTED: similar to predicted protein [Ciona intestinalis]
NCBI nr blastxgi|1892423071e-6433.39%PREDICTED: similar to AAT1-alpha [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL17032 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201597-TA
ATGAGCGGTGACACAATATACGAAGCACCTAGAAGATTCAATGTCTCCAAAATGAGAGTCCACGACTATTTATACGATTCAACGTACATCGTGTCCGGAGCGAGGGATTACGCTCGAACAGCGTTCAAAGCGGCCATGGCTTCCGCGCAAGTGGTTATCCAACCGGTCTACAAGACAATGTTCTCAGAACTGCGCCAATTCCCTAGAATGCAAGTCGTCTACCACCCCAACTGTCGTTTACCCGCGCATATAGATCGCTCGTACAGAGCTTACTTGGAACGAACAAGAGGTGAAAGGGCCGGACCGCCCCAGCTTGGAGGCAAAGATCGCTTCAAATTCTCAGCCGTGCCGAAAAAGCTTCTGAATCTCAAAAGCGCCACCCCTGACTTCCACCCCACAACGCCTCGTCAGCGGCGGACCAGTTCAAAGCCAAGGAACAGGGGCACGCAGAGTTTGTATCGCGAATCATCAGCTCAGACGATACCCTGGGGTCCTGACGCCCGGCCGGCGGATAATGTCGAGGAGACTCCAGAGGTTATGTACCTTGATGCTTTAGAGTGGGGCCCTGGATGTCCGTACCGACTTGGCGACCTACCAGCTGATTTCCATACAACGGAAATAATTAATAAAATGCGGCACGCGAGAAAATGGACTGAGATCGTCGAGAAGGGCCAGTTCCCGAACTTCATGAAGAAACAGAACGCCATCATCACGGACATTGAAACTAAGGATTGGATATTTAGAGAAGCAGAGATAGATGAACTCCAAGACATCCGCCTGGCGCTCCTCAACAAGCTGAAGGCTGAGCAGCAGCAGAAGAGGAACAGCCGCATCAGCAGCAAGCTGGCGAAACTGTGGGCGGATAAGAAGGACGCGATGGAACAAAAGATAGACAAAATACGACGGACGAGGGACAGAGAACTCAGGAAGTTAAGTTCCCGGGGGCGAGGTCGCAGAGCGGTGGATTCTGAGCACGCGCCACTAGCGAGACTGGGATACCGTGCTAGCCGCAGACACGCTGAGATCGTCTATGACCCGTCCTTACTCGTTCACGAAGACCATAGAAAATTAGCGGAACCGCCGGCATGGCTGGAGCAGTGCGGGCAGAACCTCACTAAGACCTGCTCGGGACATCACCTGCCGCGTGACGTCACTCAACTCTGCGAGCGGGAGACCAAGTGGAGCGAGCAATTCCTGGCGAACTTACACTCTGACCTTAAGAAAGCGAGACTCGGCGCTGCAAAAGTAACAGCTGGTCCACTGCGGGTGCTCCGCCCCCGTCAGCTGCCTACCATCGCTCGACCTCCCACACCGGAAGTGGAAGGAGTTGAGGACAATGATGAATCAGTACACCAGGCGGCGCTAGTTCTGCAAAGAGTGATAAGAGGCAGAGCTGTCCAAGTATTGATGTTCGAAGGACGCACCAGGGCTGGAGAACTCACCGAAGAGTTGAAGACAACGCACGGTCTCCAACGAGAAGACAGAGAGAGGATCGCTAGGGAGGAGTCCAAGGCGAGGGACTATCAGGCGCTGCGCTCTGAGACCGAGCAAAAGGAACAGGCGATCTCATCACTGGTCCAAGAGCTGTGTGGGGGCGCGGTGTCAGCGGCGCTGGACTTCCTGGAGAAAGAGCTGCGGAGGTTGAGGGAGGAGAGGCGACAGCACGCCTTCATCCTCATCGCGCAAGCGAGACTCGGCGCTGCAAAAGTAACAGCTGGTCCACTGCGGGTGCTCCGCCCCCGTCAGCTGCCTACCATCGCTCGACCTCCCACACCGGAAGTGGAAGGAGTTGAGGACAATGATGAATCAGTACACCAGGCGGCGCTAGTTCTGCAAAGAGTGATAAGAGGCAGAGCTGTCCAAGTATTGTGTGTGTGTGTGTGTGTTATCACAGAACTCAGGAAGTTAAGTTCCCGGGGGCGAGGTCGCAGAGCGGTGGATTCTGAGCACGCGCCACTAGCGAGACTGGGATACCGTGCTAGCCGCAGACACGCTGAGATCGTCTATGACCCGTCCTTACTCGTTCACGAAGACCATAGAAAATTAGCGGAACCGCCGGCATGGCTGGAGCAGTGCGGGCAGAACCTCACTAAGACCTGCTCGGGACATCACCTGCCGCGTGACGTCACTCAACTCTGCGAGCGGGAGACCAAGTGGAGCGAGCAATTCCTGGCGAACTTACACTCTGACCTTAAGAAAGCGAGACTCGGCGCTGCAAAAGTAACAGCTGGTCCACTGCGGGTGCTCCGCCCCCGTCAGCTGCCTACCATCGCTCGACCTCCCACACCGGAAGTGGAAGGAGTTGAGGACAATGATGAATCAGTACACCAGGCGGCGCTAGTTCTGCAAAGAGTGATAAGAGGCAGAGCTGTCCAAGTATTGATGTTCGAAGGACGCACCAGGGCTGGAGAACTCACCGAAGAGTTGAAGACAACGCACGGTCTCCAACGAGAAGACAGAGAGAGGATCGCTAGGGAGGAGTCCAAGGCGAGGGACTATCAGGCGCTGCGCTCTGAGACCGAGCAAAAGGAACAGGCGATCTCATCACTGGTCCAAGAGCTGTGTGGGGGCGCGGTGTCAGCGGCGCTGGACTTCCTGGAGAAAGAGCTGCGGAGGTTGAGGGAGGAGAGGCGACAGCACGCCTTCATCCTCATCGCGCTGCGTGAGAAGACGATGCGAGAGGCAGCAGAAGCCGGCAGAAGACAGAAAGAAGAACATCGCAGAAGAGAACATGACGAGATCTTTAAGAGGGTGTTGGGAGTAACTCAGGAAACTGTAGACGCGTACCTTCAAGATGTGTTGCTAGAGGGCGTTGCTCTCGCTGCGGAGGAGGATGCTGTGAGGAATGTCCTCTCGTCCGCTGATAGGATGGACAAAGAACTGGCTGCATCCGGTTCGATATCGACGGCGGAACAAAACGAGCTCGTCGCTGAGCTCGTACAGCAGTTCCTGCTACCTGAAGCTCACAAGACGGCATCTAGGCACAAGATAACCACGATACAGAAGGGAAGGCTAGAAGCGGCGAGAGCGAGCATCATGGGCGTGTTGATAGACGCTGAAGATGTTGAGGCGACCTGTCCTCGCTGCGGGAGACCTTTGGACGATCAGTACAAGTGCAGCGTCTGCAAACCAACTGCGACCACAGCCAGAGACGATCCCAGGTGGAAGCACACAAAGCGTCGGGAAGTCAAAGAGAAAATGTTGTCAGAACGATATCCAGCTAGTCACGAGATACGATGCATGTTGAACGCTCTAGTTTATGATGCGGTCGAAACGTCGCGTTCCGAACGCCGGGAGAGGGAATCCATAACGAGATTTATAGCACGGCGTCTGAGAGAAGACACAGAGATAAACATAGACGCCATAGAGATGGTCAATGAAGCGATCGCCAGAAGCACCGGCGAGGTGGTCGTCACCAAGCGACCTGATTATCATCATTACATGACGAGGATCTGTGAGGATGCTCTAGCAAGGACCCTTCCGTACGCCACTCCGCCATGCCCTAAGGAGCTGCCGTCCGAGATAAGGAGGAGAGCGGAGGAGGCCGCGGCCTTGGAGGACCCGACCTGCCGCTGTGACGAAGATAAGAGTAAAGTGAAATTCGGGATCAGTGTAATGGACGCGAAGGAAAAGTCAGAACTCCTGCCATCGGAGCTGCGACTGCTGGAGGAGCTGCGGCGGTGCAAGTGTGATACGAACCCGTCACCGAGCCCCATCACCGTTTCGTCATCGTCAGACGGTTCCAGCAGTTCCGACTTCACCGACGCCGTCACGGAAGAGGAGGGAAATGAAAATCAATAA

Protein sequence:

>DPOGS201597-PA
MSGDTIYEAPRRFNVSKMRVHDYLYDSTYIVSGARDYARTAFKAAMASAQVVIQPVYKTMFSELRQFPRMQVVYHPNCRLPAHIDRSYRAYLERTRGERAGPPQLGGKDRFKFSAVPKKLLNLKSATPDFHPTTPRQRRTSSKPRNRGTQSLYRESSAQTIPWGPDARPADNVEETPEVMYLDALEWGPGCPYRLGDLPADFHTTEIINKMRHARKWTEIVEKGQFPNFMKKQNAIITDIETKDWIFREAEIDELQDIRLALLNKLKAEQQQKRNSRISSKLAKLWADKKDAMEQKIDKIRRTRDRELRKLSSRGRGRRAVDSEHAPLARLGYRASRRHAEIVYDPSLLVHEDHRKLAEPPAWLEQCGQNLTKTCSGHHLPRDVTQLCERETKWSEQFLANLHSDLKKARLGAAKVTAGPLRVLRPRQLPTIARPPTPEVEGVEDNDESVHQAALVLQRVIRGRAVQVLMFEGRTRAGELTEELKTTHGLQREDRERIAREESKARDYQALRSETEQKEQAISSLVQELCGGAVSAALDFLEKELRRLREERRQHAFILIAQARLGAAKVTAGPLRVLRPRQLPTIARPPTPEVEGVEDNDESVHQAALVLQRVIRGRAVQVLCVCVCVITELRKLSSRGRGRRAVDSEHAPLARLGYRASRRHAEIVYDPSLLVHEDHRKLAEPPAWLEQCGQNLTKTCSGHHLPRDVTQLCERETKWSEQFLANLHSDLKKARLGAAKVTAGPLRVLRPRQLPTIARPPTPEVEGVEDNDESVHQAALVLQRVIRGRAVQVLMFEGRTRAGELTEELKTTHGLQREDRERIAREESKARDYQALRSETEQKEQAISSLVQELCGGAVSAALDFLEKELRRLREERRQHAFILIALREKTMREAAEAGRRQKEEHRRREHDEIFKRVLGVTQETVDAYLQDVLLEGVALAAEEDAVRNVLSSADRMDKELAASGSISTAEQNELVAELVQQFLLPEAHKTASRHKITTIQKGRLEAARASIMGVLIDAEDVEATCPRCGRPLDDQYKCSVCKPTATTARDDPRWKHTKRREVKEKMLSERYPASHEIRCMLNALVYDAVETSRSERRERESITRFIARRLREDTEINIDAIEMVNEAIARSTGEVVVTKRPDYHHYMTRICEDALARTLPYATPPCPKELPSEIRRRAEEAAALEDPTCRCDEDKSKVKFGISVMDAKEKSELLPSELRLLEELRRCKCDTNPSPSPITVSSSSDGSSSSDFTDAVTEEEGNENQ-