Monarch geneset OGS2.0

DPOGS208106
TranscriptDPOGS208106-TA1800 bp
ProteinDPOGS208106-PA599 aa
Genomic positionDPSCF300395 + 84461-86615
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0142674e-17854.82% 
BombyxBGIBMGA001670-TA8e-11144.31% 
DrosophilaMeics-PA6e-1424.37% 
EBI UniRef50UniRef50_UPI00022339C08e-1628.21%UPI00022339C0 related cluster n=1 Tax=unknown RepID=UPI00022339C0
NCBI RefSeqXP_002429661.13e-1526.18%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2608052025e-1723.89%hypothetical protein BRAFLDRAFT_80515 [Branchiostoma floridae]
NCBI nr blastxgi|2607851633e-2326.21%hypothetical protein BRAFLDRAFT_267706 [Branchiostoma floridae]
Group
KEGG pathway 
Orthology groupMCL34732 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208106-TA
ATGCCATTGATTTGCAATGGAGACAATGAGGAACTATTGGAAATCGACGATAGTGCTGTGGAATTTCAGAGGAAAAAGAGAAGTCTATCCACAGAGGAGACATTTCGCGGCTTCGAAAAAGTGCCTCTGGTACTTTGTCAGCGAATTGATATAACTCCTTATATAAATAAACATCAACATCCCAAGGCGCCCGACTTGAAAGATAATATGAGGCAGGTGGAAAACTCGAATAAATGCGACATGAGCAAAATATTTACAGATTGTTCGGTAGTTTTAGTGAGGGAGGATCTTAGCCGTTTAAAGGAAATGCTCAGTCAACAGAACGAGAATATTAAATGCAGGATATGCGACAAGGGTTATCCCAGCGAGAGGAAACTCAACAACCACCTGGAGAACAAACACATGACAGTGAAGACTCCCAAACGAGTCTCCTTCTCCGAACACATAATAGTTCACGAAGTTGAAGAATACCATCGATGTAGGAAGTGTCCGAAGATATTTAAAGACTACAACACCCTAAAGGTCCACATGCGACAGAACCACAAGAAACGGAAATGTTACATCTGTCACTATTGTAATAAAGACTTTGTGGACAGAACCTTCTTTAAGGTTCACATTAAACTGCACTGCGACGCCTGTGGGCTGTTGCTACCAAATAAAAAGTTATATCTGGAGCATAGAGAGAAAGTGTGCAGAGTGGTCAAGAAGTACGAGTGTAAGACCTGTGTGAAACACTTCTTCCGCTTCATGGACCTGAAGGACCATAGCTACGAACACCTGGGAACCTTCTTCATATGTGATGTGTGCAAGCAGCTGTTCCATAACAAATGTGAAGTGGCGCACCACATCATGTACTCACATTCAAAGGAACGTCCAACATCCAGCTACACGGAAGCATCGGGTTCCTTTACATGTTTCTTCTGCAATACGAGCTTTAATACCCTGGAAGGTATAGAAAAGCATGTCGGTGAGTTGCCGGACTTGCAAAATACAGCGACCACTTATTATAATGATTATCATTTCTGTGATCAATGCAGTAGGAAGTTCGATGTGGAGTCCGACATGCTCCAACATAAATGGTCACACTTCCTCAAGAGCACTGATAACTCACAAACACAGGAAGATAATGATTTAATAACTGATAACAGCACTCGGTTAAAAACGACTTATAAATTGGGAGAAGAAATACCTGTGAGTCTGCAACCAAAGTTGGTTCTCGAGAGGATAGAACTTCCTAAGAGAGCCAAGATGAAGAAGTTACCGAGTCGGCCGGAGAGTTTCATAGGTGTGACCAATGCATCGATAGGAACATTAAAGAAACCCATCATAGACCCCGTCACCAAGAAGACCATACTGTCAAAACATAAATGCGAGAAATGTGGCAAGTACCTGTCATCCAACTACTGTTTGACACGACACATGAGAGAAGTCCACGGCATCGGCAAAGTAACATATGACGAGGATCTCCAGTGTTACTTGTGCGAGGAAGTTTTTTTCTGGCCCTCGCTGCTCCACAACCATCGCTGCATAAGAAGCAAAATACCAGAGATGCCATTTGATGATGCTCGTCCTGAGATACACTTTGATAATTACGAGGAGTCCCTTCACAATGATAACGAGGACTTCATGAACATGGACTATGAGATGGCTTCACCAATAGTTCAGTTGACGGAGTATGAAAATCTAAATATTGTGGTCAACAATGGGAACGGTCGGCTGGATGTTATAGACAATGAGAAGAACCAGATGAACAGGTTAGGGTTCAAAGTGGTGATGCAGGAAGTGCCCATTGAGTTCTAG

Protein sequence:

>DPOGS208106-PA
MPLICNGDNEELLEIDDSAVEFQRKKRSLSTEETFRGFEKVPLVLCQRIDITPYINKHQHPKAPDLKDNMRQVENSNKCDMSKIFTDCSVVLVREDLSRLKEMLSQQNENIKCRICDKGYPSERKLNNHLENKHMTVKTPKRVSFSEHIIVHEVEEYHRCRKCPKIFKDYNTLKVHMRQNHKKRKCYICHYCNKDFVDRTFFKVHIKLHCDACGLLLPNKKLYLEHREKVCRVVKKYECKTCVKHFFRFMDLKDHSYEHLGTFFICDVCKQLFHNKCEVAHHIMYSHSKERPTSSYTEASGSFTCFFCNTSFNTLEGIEKHVGELPDLQNTATTYYNDYHFCDQCSRKFDVESDMLQHKWSHFLKSTDNSQTQEDNDLITDNSTRLKTTYKLGEEIPVSLQPKLVLERIELPKRAKMKKLPSRPESFIGVTNASIGTLKKPIIDPVTKKTILSKHKCEKCGKYLSSNYCLTRHMREVHGIGKVTYDEDLQCYLCEEVFFWPSLLHNHRCIRSKIPEMPFDDARPEIHFDNYEESLHNDNEDFMNMDYEMASPIVQLTEYENLNIVVNNGNGRLDVIDNEKNQMNRLGFKVVMQEVPIEF-