Monarch geneset OGS2.0

DPOGS207750
TranscriptDPOGS207750-TA3579 bp
ProteinDPOGS207750-PA1192 aa
Genomic positionDPSCF300042 - 623706-628497
RNAseq coverage192x (Rank: top 48%)
Annotation
HeliconiusHMEL0119710.044.77% 
BombyxBGIBMGA005305-TA2e-12944.44% 
DrosophilaCG1832-PB2e-0723.61% 
EBI UniRef50UniRef50_E0VMU73e-3534.29%Zinc finger protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VMU7_PEDHC
NCBI RefSeqXP_002427441.15e-3634.29%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420134971e-3434.29%zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420134972e-3833.10%zinc finger protein, putative [Pediculus humanus corporis]
Group
KEGG pathway 
Orthology groupMCL18303 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207750-TA
ATGACGACAGAACAGAAATATGATTGTGTATTTTGCAAAGAGATATTCGATGATAAAGAGGCATTACAAATACATTTCAGAAAACATGGGGATCCAAAATTTAATAAAATATCAAAGTCAAGAGGGCGCACATCGAATGAAGAATCAAGTACTGAGAAACCGGTAGAGGAAAATGAAATGGTTGGATGTGATGTGTGTGAAGAAGTGTTCCCAACTATATCGAAAGCAATAACTCATAAGCATAAAGTACATCCAGACCACGATGCGAAATACTTTTGCTCATTCTGTGGTAAAGTCTTTACAATGAAGCATCTTTTCAATAAACACATTCAAACAAACCACGATGGTGAACCCACAAACGACACAAGAGATTTTTATTGTGAGTGCTGTGAAGTTGCATTCTATGTAGCACCAGCTATGCTGTATCACAACAAGTTCTTCCACAGACAGGACTCGGAGCTGCCGGCTATTGGTCAATCAAAGAAAGTAAAGCTATACAATCAGGAATTGCTACAAATATTCTATTGTGCGTTCTGTGGTGAGGAATACAACAATAAAATCAATCTACATAAACATATGGGCGATGATCATGCTGATGAGCATCAAAGTCCGACCGAGGTGTTGCGATGTCCGCTGTGTGAAGCCATCTTCTATCACTTGGATGCGTTCGAGGTTCATCTGACTTTCCACACTACTGAGGACTTGTACAGTGAAAAGAATGAAAGTGCGGAAGGAGTCACAGAGTTCTCATTGGAAACAGTACCACCGATAATGGAGAAAGTCGAAGATGACCAGCAACCCGAAGACAATATGAACGAAGAAGGAATTGACAGTTTTCTCCAACTCGTCATGGGCGAATCCGAGGAGCCGGAAAAAGTTAAAGTAAAGAAACACAAAAAGCACAAGAAATCAAAGAAATCAGCCATAACACTAGACGAGTTCTTAAACATGAATAAAGATGTGTTCGGCGACGGTCTGGATGTACAGGGCATCGAGGAAGTACCGACGCCGTTCGTACTCAAGAAACCTAAAGTCAAGAAGGTCGTAAATAAGGTTGTGAATGCAGATTTAGCTAAATTGAAGAAAATAGGCATAACAGTTAAAACGAAAGTCGCCAACCCTGTCGCTAACATTAAGGGCGTGGCGGCGAAAATAAGTACACCCAACACTAGCAATAGAAACAAAGTGAATTCATCAAGTTCTCCAAACGAAATTATATCGAAATTAATGAATCAAGGAAATAGTCAGATAAAAATAATCAAGAAGACTGTACCGCAGAATGTTGCGCAGAATATTATTCAAACTAATGATCAAGAATCGGTAACTAAAGCTCCTGATGTAGGACTAACTAAAGATGTCAATGAAGGTCTAAATCTAAATCAGTCCATTGAAAATGAAGCCAAAGAGGAAAATAATAGTGTTGTCGATGACAATACAGACATAGAAAGTAGTAATGTCCATAAAGATGTTTCAGAAGTAGGGGAAACTACACATTGTAAAGAGACCGACTTGGATAATATGACAAATATACCAAAGTCAAACAGTAGTGTAAAAGAAGTCGATCGCACGGAACCAAAAACATCAGGACAGAATAATACAGAAGAGATTTTAGAAGGCGGCGGTAATATATGTGAGGCGACCGACAATTTTCAGCCCAGTCAAAAAAAAGATGACATTATAAGCGAGTCCACCAGTTCGTGTATTAATACAAGTAAAGAAAACGAAGGTGACATCGAAAGAGACGTGGATAATGTAGCTTTAAAGACTTTAAACGCTTTAAAACATTTAAGTCATCTATTAACAGTGAAACCTGTGGTGAATAGTAAGAGTGTTCTGAAAACGAGCGAAGCTAACAATGTTAACAAGGAAGCCGTCGAAGCTGCGAAGGAAACGAAACTAGATAAACCGCTGAGAAATTTATCGGAACAAATCACAATAAAACAGCCAAAATCTCCGTCGGTTAGCGCGAACATGGCGCCTGATGGTGACAATGAGAACGGTCACCCGGGCAGTGATGTCGACTGTGATGATGTCTCGCAAAGTAACGACGGTGCAAGTAAAACTAACGCACCAAACCGAGTTCATTCACCCTACACAGAGAGGACCTCGACACCAACGTCAACGAAGTCAAATAATTCAGTCTGTAATAAATCATCCGACGCAATTACTAAAAGCGAGTTATGTCCTAAAAAAATTGCAAATCTGAATATACTCAAACGTCTCACAAACGTCACAGCGAAACCGCTCGGTAAAGCGAACACGCGGTCACCGAATAACATAGTGAATAAAAATATTACAACTACAAATATCAAACAGGAAAAGGGTAAAATTTACGAAGAAATTGAAGTTTTTAATATTGACGATTCCGATAGCGAAGACAACGAGCAAACGGAAGTGGTGTCGGACGACGTGAAAAAAAATAATGTGCCCTTGGATGCCTTGAAAAGTTTAAGCAAAAATATCACCGTGAAGAGCAGTCAACTAACAAACTCGAAAGTTACGACGTATAGAGCCGAAAGTAATTTCGAGAATTATACAAAATTCAACAAACAGACGCCGGAAATACAGAAGAGCATCAATTTACAAAATAAACTCAACAACTTCGGAAGTCATATAAGGGTGAAATCGAGAAACTCATCGCCGATAAATAAAGATAGCAGAGATGTTAGTGACGCTGACGATGAAAACTATGAACAAGATTTCGACAGTGGGTCGGACGATGAAGGCGATGTGAGAATTACTGAGGTCCATGATGACCCTGGAACAGATGGAGAAGGACATGAAAACGACACGAACGCCACGGTGCAGTCACCGTCCGGGACACACAGCGACCAGGAACAGGTCAATGAAGACATCGCCGACAGAACATACGATAACTCAAAATCATCACCGTTACCGAGACCAAATGATGCGCCAACAAAGAAAGCCCCAGGTCTAGAAAGCCTGAATTTAAACAAGGAACTCACAATTAAATCGCTAACGAAGAATTGTGATAATGACGAAGGTTCAAATCATTCCCGCCACAGAGATGTCACTGAAGACACTAAAAACCCTAGTAAGGAACTCGTGCTGAAGCAATTCAGACAGAACACGACCAACGAGCAACCGACGAACAAGACATCGCAGCCGACCGGCGGGTCCGCGAGTCAGGCGTTCAACCAAAAAGTTTCTACATCATCAAACCAAGTAACCAAAACTGTGAAGAGGTTCCAATCGCAAACCATCATCGAGGAGATCACGACCACTGTCACTAAAACAATCCGAACAGTCAATCAATCTACCAATGAAGAAGTACAGAGCACCAGCCAGCTAGCGCCCAAACCTAATGCGAGACCTCAAAAAATCATAAACAGATTGCCCCAACCGGGCAGAGAATTCCAAGGAGCGACGATCAGGCAAATCACACCAACGGTGGGAACAAAAATTAGAAGTACGGGCACGGCGGTGAGATCACCAAAACCGCTGGTGCGTCCTTCAAATCAACTGGTTCCAATAAGGCCGTCCAATTTGATAAGACCCACATTACCAAGTCATAGGAAAATTAAAAATTTCCCCACACGCCATCAGTCAGACTGTTAA

Protein sequence:

>DPOGS207750-PA
MTTEQKYDCVFCKEIFDDKEALQIHFRKHGDPKFNKISKSRGRTSNEESSTEKPVEENEMVGCDVCEEVFPTISKAITHKHKVHPDHDAKYFCSFCGKVFTMKHLFNKHIQTNHDGEPTNDTRDFYCECCEVAFYVAPAMLYHNKFFHRQDSELPAIGQSKKVKLYNQELLQIFYCAFCGEEYNNKINLHKHMGDDHADEHQSPTEVLRCPLCEAIFYHLDAFEVHLTFHTTEDLYSEKNESAEGVTEFSLETVPPIMEKVEDDQQPEDNMNEEGIDSFLQLVMGESEEPEKVKVKKHKKHKKSKKSAITLDEFLNMNKDVFGDGLDVQGIEEVPTPFVLKKPKVKKVVNKVVNADLAKLKKIGITVKTKVANPVANIKGVAAKISTPNTSNRNKVNSSSSPNEIISKLMNQGNSQIKIIKKTVPQNVAQNIIQTNDQESVTKAPDVGLTKDVNEGLNLNQSIENEAKEENNSVVDDNTDIESSNVHKDVSEVGETTHCKETDLDNMTNIPKSNSSVKEVDRTEPKTSGQNNTEEILEGGGNICEATDNFQPSQKKDDIISESTSSCINTSKENEGDIERDVDNVALKTLNALKHLSHLLTVKPVVNSKSVLKTSEANNVNKEAVEAAKETKLDKPLRNLSEQITIKQPKSPSVSANMAPDGDNENGHPGSDVDCDDVSQSNDGASKTNAPNRVHSPYTERTSTPTSTKSNNSVCNKSSDAITKSELCPKKIANLNILKRLTNVTAKPLGKANTRSPNNIVNKNITTTNIKQEKGKIYEEIEVFNIDDSDSEDNEQTEVVSDDVKKNNVPLDALKSLSKNITVKSSQLTNSKVTTYRAESNFENYTKFNKQTPEIQKSINLQNKLNNFGSHIRVKSRNSSPINKDSRDVSDADDENYEQDFDSGSDDEGDVRITEVHDDPGTDGEGHENDTNATVQSPSGTHSDQEQVNEDIADRTYDNSKSSPLPRPNDAPTKKAPGLESLNLNKELTIKSLTKNCDNDEGSNHSRHRDVTEDTKNPSKELVLKQFRQNTTNEQPTNKTSQPTGGSASQAFNQKVSTSSNQVTKTVKRFQSQTIIEEITTTVTKTIRTVNQSTNEEVQSTSQLAPKPNARPQKIINRLPQPGREFQGATIRQITPTVGTKIRSTGTAVRSPKPLVRPSNQLVPIRPSNLIRPTLPSHRKIKNFPTRHQSDC-