Monarch geneset OGS2.0

DPOGS215447
TranscriptDPOGS215447-TA5112 bp
ProteinDPOGS215447-PA1703 aa
Genomic positionDPSCF300298 + 231531-249430
RNAseq coverage3018x (Rank: top 4%)
Annotation
HeliconiusHMEL0163180.059.02% 
BombyxBGIBMGA005738-TA0.054.12% 
DrosophilaAnk2-PU1e-14646.13% 
EBI UniRef50UniRef50_UPI00021A88892e-16749.84%UPI00021A8889 related cluster n=4 Tax=unknown RepID=UPI00021A8889
NCBI RefSeqXP_392578.38e-16552.86%PREDICTED: similar to Ank2 CG7462-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3454883371e-16749.04%PREDICTED: hypothetical protein LOC100122475 [Nasonia vitripennis]
NCBI nr blastxgi|3454883379e-17528.40%PREDICTED: hypothetical protein LOC100122475 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[131-235] IPR0009061.4e-28ZU5
Orthology groupMCL25162 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215447-TA
ATGGGTGTCGTCAAAAATAAAAAGAGCAAAACGAAGAGCGATGATTCTAAAAAGGATGTTAAAAAACAGAAAATTGATGACGGGAAAATGAAAAAAAAGAAAAAAGTGTCGTGTTTACAATGCTTCAAGTCGCCAGATGATGGTGGTTTAGAGGGCAAATATCGAGTGGCTGCACCTGAGCTCATGCAGGACACGTTCATGAGCGACTCTGAAGACGAAGGCGGAGAAGTAGAGTGTCCAATCCAACAGCAACAACAGTACAGATACATGAACAGCGAGGCGGGTACGTTGAACCGGGCTAGACCGCTCGAGGATAGTGTCACGGACGGACATCTGTGGCCCAGCAACAATGACAAGAGAGTCGCTACCATGGAGAGACAACCGCTTGATATCGGGTTCCTGGTATCTTTCGTTGTGGATGCTCGTGGTGGTGCCATGAAGGCGAAGCGTCGTGGTGGTGTGCGAGTCATAGTACCACCAGCAGCCTGTGCCGCTCCCACGAGGGTAACATGCCGTGCTGCAACGCGTCGCGCACCCGTCGCTGCCCCACCACCCCTGATGGAGGGCGAGGCCCTAGCCTCACGCCTGCTAGAACTACAGCCGCAAGGAGCGAAGTTCCTCGCGCCCGTGATCATCGAAGTACCTATCTTCACCGCGTCCTGTCCTGAAAGGGAGATCGAGGCGCTAACTCGTGAAAAACAAGAACACGGTCCAGGCGGAGAGAGACTCGGAGAGAATGGAGAGAGAGTCACGCGTATCATAACATGCGACTTCCCTCACTATCTGGCGGTAGTGTCACGTGTGAGGCAGGAGGTGCACGTCATAGGTCCCGAGGGCGGGACCGTGTCATCGGCACACATACCACAAGTACAGGCCTTATTCCCGCCGTCGGCCCTCACGAAGAGGATCCGGGTGGGGTTGCAGGCTCACGGCGCGGATAAGGCGCTCTGCTCGCGACTCCTGCCTCGTCACGCGGCCGTGTCCCCCGTACTAACTGTGGAGCCCCGGAGAAGGAAGTTCCATCGCACTATAACGCTGACGGCGCCATTGCCGCAGCCGCACGACCATAAGCTGAACAGTCACGACAAGCAGTCAACAGCAAACCTTCGTCTTTTATGTTCAATTATGGGCGGTCAAGCGCGAGCGGTTTGGGAGGACGTGACCGGGTCCACTCCGCTCACCATCACCGACGACTGCGCCTCCTTCACCACCACTGTATCAGCCAGATTCTGGTTGATGAACTGTCAAAATGTCAGTGACGCGACCAAACTGGCCACGGAACTATACAGAGAGATGTTACTGGTGCCTTTCGAGATACGTATAGTTGTTCTTGGTAAACGTTTGGATTCGCTGGAAGGTCGCCTCTTGGTGCTGTACATCACGGACAGATACGCCTATGACACGCTGCTACATCAGGAGCACTATACTGAAGTGGCTCATTCCACGTCCGTCCGTCTATTGGACGGTCGTCCTGTATATCTGGAGTTCTCTGGTAACCTGGTTCCCGTCACCAAGTCCGGTACTCAACCAACTCTGGTCTTCGAGGCCTTCAAAGACAACCGCGTTGAGTTTCCTGTGAGAGTGAAGCATCACGAAGAAAGTCCCTCGGGAAGGATTTACTTCATGAATGAACCAAAGGTCGCTAAAGGCGAACCGTCCGCCAGTCCTACCTGTGTGCTTGATGTGTCGCTTCCTGAACGCGTTGCACCCCGACCAGCCAAGAGTCATTTGGACTGCCTCAACTTGGATCAAAGCGGTTTCGACGCACTGAAGGATGAGTTGAGTTTCAACTGGGAACACAACCACAGCGTCGGACCGCCCACAGTGCCCATAGACCACGCTACTACACAACTCAACGGCCACGATAAATCAGTAGCTAACGGTGACGTGAAGTACGAGACCAAAGAGGATGACAGAAAAGATATGCCGCGTATCAATGGTGACGCGACGCATGATTCAAATAAGGCTAAAGCAACATTTGATTCCGATGAAGACAAAACACCAATGAACACTTTGGAAAAGGGTAAAAAGAAAGCTGAAGGCGGTTCATTTTTGGGCGGTCTAGCTGATAAAGTCAAGAACGTATTCAGTCACAGCGAAGACAAAGAGGAAAGCAAAGAATCGTCTCCCAAACCGCAACCGAAACCGAGGGAATCTAAAGAGAAGTCTCCACCGAAACCAGCGCCAAGGATTATTAACGAAGATTTGTTGGAAAAGGTCGAAGATATGTTCAAAGACGTCCGCACAAAACCACATCTCAAGGACGAGGATGATGTTAACAGTCAAGTTTATTCAACAAAACTTATAGTTGATGAAGGAAATCCTGATGAAGTTCGTCTGAAAACTGAAGAAGTGTACCATCAGTATGAAAACATGCAGCCCGTAGACACCCCACTATATTCGAGAAAGAGCGATCCTTTCCAATTCTTTACCGGATGTTGTGAAGACAAATATAAGAAAGGTGAAAAACACCGTCACGACGAAACATCAACCTTAGCTGATAGAATCGAACAAGTCAAATTTTCTGCAAATGAACAGATCCAAGAAATCGAAGATAAGTATCAGAAGGATTTACAAAAAGATCTCAAAGAAGCGGAAAGAGAGAAAGATAATATCAAACAAGAACTCAGCGAAATCAAGGAAGATATTATTGTAAAGTCCGAGAACATTAAAAACAATTTAATTGATAAAGAGGATAAATTGAAAGCAGGTTTTGAAGATAAAGTTCAAGAGTTCAGTGACGTTGTTAACAAAAAGAAGAATGAACTGTCAGAGTCGGGAGAAAAGATTCAATCTGATATTGGAGCTAAAGTCGTTGACATGCAAACTAAAGTATCTTTGACTTTAGACAACGCGAACGATGATGTAAATAATTTATTTGACACTGTTGACAACACTAAACAAAATATAAATAATAAATTTGTTGCACTCGATCTCGCTGCTATGGAAAGAGCTGAAGAAATGAAGCTTAGGGGTCATAATAAATTATCTCAAATGGGAAAAACTTTTGAAAATGCAGCCGACGATGTCAGTCATAAAACAGAAGAAAGTATCAATGAAGCGTCAAATAAAATAAATAATTTAAAAACTGACATTGAAAACAACGCTGAAAACGTAAAAGATAAAGCGAATGAAAAGGTGAATAATGTAAAAGTAAATATTAACGACAAAGTAAGTGCCATTAAAGCTGAAGCCGAAGAAATTGTAGATGAAGTAGAAGAGAAAAGCAAAAAAGCTGTAGCTGAAGTCAAATCTAAGAGTAAAGATAAAAAGAATGCAGCAAAAAAGAAAGCAAAGGAGGGTAAAAATTTCTTTACGGGGCTTGTCCACAATATTTTCGGGGAAAAGGAAAAAATTGAGGATGAAGTTAAAGGTAAAATAGCTGAAGGCAAGGAATCTGCGAGCAAATTATCAGAACAAGCGTCGGCTGCTAAAGAAAAATTAGAAGAAAACATTGCTAAAGAGCAAGATAAGTTAAAACAAAATACAGAACAAATTATGCAACGCACAGAGAATACGGCTGAGGATGTTGTTAAAAATGTGAAGGATGCCAAGGATGATTTATTGGAAAGTGCACAAGAAAAAGCAGCGCAAGCTGTCAACGGCTTAAACAAGAAGAAAGAGGAAGTAATACAGAACGTAGAAGGTAGAGCTAACGAAATATCTACAAGTATACAAGATAAAAAAGATGAGTTATTGCTGAGTGCCCAAGAAAAGAAAGGTGAAATCTTACAAAATGTACAAGACAATACTGAACTGGTAGTTACAAATATCAAGGCAAAGAAAGATGAACTAGTACACAAAAAAGATGAATTAATACAAAGTGCGGAAAATAAAAAAGACCAACTATTGCACAGTGCTCAAGAAAAAGGTGAAGAAATATTCGCAACCATGGAATCAAAAAAGAATGAAGCGGTGCAAAAGAAAGATGAACTACTACAGAGTGCAGAAGACAAGAAAAATGATATATTGGAAATGACCAACAAAGAAACTGAAAGAATAGTAACAGCTGTTAAAGAAAAGAAAAATGAAGTGTTACAAACCATCGACGATAAAAGTTCGGAAGCTGTTCAAAATGCCCGAGCAAAAAAAGATGAAGTTATATCCGGAATAAGCGATAATATAGACAAAGCTAAACAAACAGTAGGTGAAACTAAAGACAATCTTATTTCTGGAATAGCGGCAAAAAAAGATGAAATGATAAATTTAAAGAACACGATGCAAGATGGCGCCGAAAACAGACTTAATCAAATAAAACAAGAAGCTGAAGAGACTACTGAAGAAATCAAAAGAACCGCCGAAGAAACTCTGAATGCTGCTGCTGAAAAGAAAATTGAGTTAAAGAGCAACATTGCTTTGAAATTCAATGACCTTAAAGATGCTGGAGAAGATAATTTAGAAAGACTACGGGATGCAGCGAAAGATAAAGCAGAGAAAGTGCAGAAATCAGCGAAAGTAACGTTCGAAGACGTCGAAGTATCAGCCTCCAGTGCCTTCGACAAAATCGAAAGTGCCGCGATGGACAAAGCTGGTGACGTTAAGGACATATATGATGACCTTCACAGTTCAACTCGAGATACATTCGCGGATCTCAAGGACGACGCTCAAGGATACGAAAGCTCAGCCCAGGATACGTTGCACAGCTTTCAGAGTTCTGCCGGAAATACTTTTGACAGTTTGGACGAATCAGCTAAGGAGTTCATCGGTGATGTCCAGAGCACCGCTGCAAGAAGCGAAGAAAGCGCTGACCAGCTGTTCGGTAATGTCAAGGACAGCTTTGAAGGATTTCAAATAGCGGCTGGCGATAAGCTTGAAGATATAAAGGTCTCTACAAAACAACACTTCGAAGAGGGTCAAGATAATTTGGGTAAAACTAAAGATGAAGTTGTTCAAGGTTTGAAGGAGTCGGAACAAGTTGTGACTGATACTCTAACTAGCGAAGCTGACAAATTCACTAGTAGCCTCAATGACCTAGGGAACTCCGTTTTTGGGATGTCGGGTAAAGGATTCATGAAGGATTCCAGTACTAAACTGCTTGAATCGGAGAAATCACAATCGTCGCCACACAAAAAGAGTTCAGGGGCCGTGTCCAGTGAAACAATTGTAACGTAG

Protein sequence:

>DPOGS215447-PA
MGVVKNKKSKTKSDDSKKDVKKQKIDDGKMKKKKKVSCLQCFKSPDDGGLEGKYRVAAPELMQDTFMSDSEDEGGEVECPIQQQQQYRYMNSEAGTLNRARPLEDSVTDGHLWPSNNDKRVATMERQPLDIGFLVSFVVDARGGAMKAKRRGGVRVIVPPAACAAPTRVTCRAATRRAPVAAPPPLMEGEALASRLLELQPQGAKFLAPVIIEVPIFTASCPEREIEALTREKQEHGPGGERLGENGERVTRIITCDFPHYLAVVSRVRQEVHVIGPEGGTVSSAHIPQVQALFPPSALTKRIRVGLQAHGADKALCSRLLPRHAAVSPVLTVEPRRRKFHRTITLTAPLPQPHDHKLNSHDKQSTANLRLLCSIMGGQARAVWEDVTGSTPLTITDDCASFTTTVSARFWLMNCQNVSDATKLATELYREMLLVPFEIRIVVLGKRLDSLEGRLLVLYITDRYAYDTLLHQEHYTEVAHSTSVRLLDGRPVYLEFSGNLVPVTKSGTQPTLVFEAFKDNRVEFPVRVKHHEESPSGRIYFMNEPKVAKGEPSASPTCVLDVSLPERVAPRPAKSHLDCLNLDQSGFDALKDELSFNWEHNHSVGPPTVPIDHATTQLNGHDKSVANGDVKYETKEDDRKDMPRINGDATHDSNKAKATFDSDEDKTPMNTLEKGKKKAEGGSFLGGLADKVKNVFSHSEDKEESKESSPKPQPKPRESKEKSPPKPAPRIINEDLLEKVEDMFKDVRTKPHLKDEDDVNSQVYSTKLIVDEGNPDEVRLKTEEVYHQYENMQPVDTPLYSRKSDPFQFFTGCCEDKYKKGEKHRHDETSTLADRIEQVKFSANEQIQEIEDKYQKDLQKDLKEAEREKDNIKQELSEIKEDIIVKSENIKNNLIDKEDKLKAGFEDKVQEFSDVVNKKKNELSESGEKIQSDIGAKVVDMQTKVSLTLDNANDDVNNLFDTVDNTKQNINNKFVALDLAAMERAEEMKLRGHNKLSQMGKTFENAADDVSHKTEESINEASNKINNLKTDIENNAENVKDKANEKVNNVKVNINDKVSAIKAEAEEIVDEVEEKSKKAVAEVKSKSKDKKNAAKKKAKEGKNFFTGLVHNIFGEKEKIEDEVKGKIAEGKESASKLSEQASAAKEKLEENIAKEQDKLKQNTEQIMQRTENTAEDVVKNVKDAKDDLLESAQEKAAQAVNGLNKKKEEVIQNVEGRANEISTSIQDKKDELLLSAQEKKGEILQNVQDNTELVVTNIKAKKDELVHKKDELIQSAENKKDQLLHSAQEKGEEIFATMESKKNEAVQKKDELLQSAEDKKNDILEMTNKETERIVTAVKEKKNEVLQTIDDKSSEAVQNARAKKDEVISGISDNIDKAKQTVGETKDNLISGIAAKKDEMINLKNTMQDGAENRLNQIKQEAEETTEEIKRTAEETLNAAAEKKIELKSNIALKFNDLKDAGEDNLERLRDAAKDKAEKVQKSAKVTFEDVEVSASSAFDKIESAAMDKAGDVKDIYDDLHSSTRDTFADLKDDAQGYESSAQDTLHSFQSSAGNTFDSLDESAKEFIGDVQSTAARSEESADQLFGNVKDSFEGFQIAAGDKLEDIKVSTKQHFEEGQDNLGKTKDEVVQGLKESEQVVTDTLTSEADKFTSSLNDLGNSVFGMSGKGFMKDSSTKLLESEKSQSSPHKKSSGAVSSETIVT-