Monarch geneset OGS2.0

DPOGS215782
TranscriptDPOGS215782-TA3570 bp
ProteinDPOGS215782-PA1189 aa
Genomic positionDPSCF300041 + 1861495-1865633
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0026190.065.88% 
BombyxBGIBMGA005056-TA2e-4936.67% 
Drosophilaarmi-PA0.032.43% 
EBI UniRef50UniRef50_D6WE130.036.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WE13_TRICA
NCBI RefSeqXP_001605981.10.034.78%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|910924420.036.66%PREDICTED: similar to armitage CG11513-PA [Tribolium castaneum]
NCBI nr blastxgi|1984760760.033.68%GA25304 [Drosophila pseudoobscura pseudoobscura]
Group
KEGG pathway 
Orthology groupMCL11796 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215782-TA
ATGTCGTCTTCTAGCAACATATCTCAATCTACATACTCTACATCTGAACCAGAGACTCCCAATAGTACCTCTAAGAGCCAAAGTCAAAATTTTGATCAATCCTTGAACAATGAAGACAGTATTGATCAACCGTTAAAAAATCCCCAAAAAGAATTACACCAAAAGAGAATACAATCTTTAAGAAAAGAACTTGAATATCTGAAAGCAACAGAATGGAAATTTGAATGTGATAAGGGTTTTCAAGATGAAGAGTCTGATGAAGATAAAGAGATGTTTTTAGCTGAAGAACTTTTACAATTAGAGTTTGAAGCCGAAAATGAAGCATCCATGATTAATCACTCGTACCCTGTTGAACCTCCTAAGCTTGCCGCCGGTGCAGTTTGTTTTCAAAAAACTGGTATTATAACAGATTGTGGTGATGACTATGTCCTCATTGATGGAATGTTATATTTTGCAACACAGAACTCACTGAGTTATAATGTCAATGACAAAGTTCTATATCTTGGTTACAAGGATTCAAATGATTCAATAAATGTTGTAAGAATATTGGAAAACCAAGGCTTATTTTGGGGTGACGAGGATGAAGAAGATGTTGAAAACTTCAATACTATTGAGCACATCTTAATAGGTCAAGTAGATTATAGAGAAGAACGGATGGTTTACATTGTGGACAGTGACTTGAAATTCAACTTAGACAATGTTGCCGGAACATTTGTTCCAATTAAAGGTGATTGGTTGGAAATGAAATGTACAGTACAACAAAATGAAAAGAGACCTGTGGATATCAACACAAAACAAGTGTTACAGGTGAAATCCTTTAATGCTATAAGAACCAAAACGAAGACGGCCATAGTTACTCAGTGGTCTGGAAGTGAAGGGGTTTGTGATAGACAAATATATATCAATAATAGTGCATTGGTCAATGGATCACAAATAAATATTGGCACGAAGGTCATGGTAGAAGCAATTGAAAGCAATCAAGGATTGTGCACATGGAGGGCACTGAAATTAATGACTCTCGAAATTGGGTCCGAGAAGAATGTGGCAGAGGAATCAAACGAAGGTCAAATTAGTCTGGCCCTAGAAAAAGAAAAAAAAATTCATATGACATACCCATTAAAGTTTGAGAATGTTAAATTTGATCAAACAGAGAGTATAATATTAAATATAACAAATAAGAGCAACAATATGTACATACTAAATAAGTGGATAGTGCTGAGTAAGAAACGAGATTCACAAGTCTGTATAACGCCATTCATCAATCAACCAATAAAATTATCACCAGAAGAAAATATCAGCTTCACTATAACATGCTCTCCAAAGTTCATGGGATATGCACAGGAGTGCCTCGTTATATTGTTTCGAGGTTTCCAACTGAAGAGACATATCAATATACATGTGTGCAGTGATCACCGACAAGTTAATTTTGATTTAAATGGTGATTGCCATATAATGGAATCAGATAAAGCTGATATGATGAGAAAAATTAGACGCAATACTAATTCATATGTACCAGGTGTGAAACCAATCAAATCACCAGCTTTCGTATCCGTGAAAATTGGTAATTTTCCTATCCCAGACAAAATCTGGGCTGTTGTCTTGGGCGATTCCAAGCAGACCATCTGTAGCAATGATTTCAATAGGGTATTATCCTTCATTGAAAGACAGCTACCTTATTTATCTCAAGATTTGAATATTACAAATTATATTGATAAATGGCACGCCCTTTTGTACATGGAAGAAATACAAGCTAACCTCAATATGCGTGTTTACGACAGGTCAAAGGTATTCTTGGTACATTGTGACGAATATCTTGGCATTGAAATACCAGGGTTGTCAGAAAAGAGACCGTCGCTCATCAAAGGAGACAGGGTCATTGTGAAAGATATTTGGAACGAATCCAATCCGGAATACGAAGGCTATATACATGCAATAAACGGTGATATGGTACTGATGAAATTCAACAGCAGATTTCATGAATATTACAGCGGCAGTGATGTTTCGATTGAGTTCCACTTTAGTAGGGCTGTGTATAGACGATCGCACCATTGCATCAACCAAGCCCTATCAAATTTAGGGCCGGACATCCTATTTCCGTCTCGTGTTATAACTAAAGAATCTCAAGTGTCCAATGACGTTTTGGAAGATATGAAATGGTTTAACCCAACTTTAAATAAGGATCAGAGAAATGCAGTGATTAATATATTGAAAGGCGAATGCCGACCGATGCCTTATATCATCTTCGGACCCCCTGGTACTGGGAAGACTGTAACTGTCATAGAAACTATTTTGCAAATTTTAACCTTAATACCAGACAGTAGGATTTTAGTTGCGACACCGTCAAACAGTGCGTCAAATTTGATAACTGAAAGACTTATAAAATACAAGGACTCGTTCTCAGGATCAGTCGTAAGATTAATCGCTAACTATCTAGTTGATTCTGACACCATACCAGAGGATGTGAAGCCATTTTGTGCCACATTGGATATAGCCAAAGAGAATACAACAAAATCGAAACATTACGTCAAGGATAACATACAACTTAATTGTCAGAAATCTTTAATAGTCAGGCATCGTGTCACTATAGGGACGTGCTATTGTTTAGGATCTTTAAAACATTTAGACATACCTCGAGGTCACTACACTCATATCATTGTGGACGAAGCTGGTCAGGCTACAGAGCCGGAGATAATGTTACCTTTGACCTTCACCAATAAGGAACATGGACAAATTATACTCGCAGGGGATCCTATGCAATTAGGACCTGTCGTTATGTCAAAATATTGTAAGGAGTTTGGACTGGACGTATCGTTCCTGTGCAGACTTCTAGAGTGCTTCCCATACTTGAAGGATTATGAATCTTACGCTTGCGGTTTCGATAAACGTCTCGTCACCAAATTGAATGATAACTATAGGTCGCTGAAGGAAGTTTTAACATTACCGAGTGAAATGTTTTACGATGGGACATTAGTGCCAAATGTAGACAAAAGTATGCCCTGGACAGAGAAATTCATTGATGCGACTTGTCAGATTTTCGGTTCGGATGATAGGAACGGCGGGATATTCGTATATGGTATTAAAGGAACCAACATGCGAGCACAGGACAGCCCGTCCTGGTACAATCCACAAGAAGCGGCGATGGTCGCATTGACGACCTGTAAACTGTTTAAGAAGAACATCACCGAAGAGGAAATTGGCATAATCACACCATATATAGCACAGACAAAATATCTACGTTTGCTTTTCGATTCCATGGGCTTGAATCAACCAAAAATTGGCACTGTTGAAGACTTTCAAGGTCAAGAACGACCGGTAATTTTAATTTCAACCGTTAGATCCAGCGAGTCGCACCTGGAGGAAGATGCCAAACATTATTTAGGTTTTGTTAAAAGCCCGAAAAGACTAAATGTAGCTCTGACCCGGGCACAAGTGTCAGTTATATTATTTTGCAATCCACATCTATTGTCCAAAGACCATTTGTGGAGAAAAGTTATAAGTTACGCCGTCTCTAGTGATAAGTATATGGGCTGTGATTTGCCAACCAGCCTCTTAAACAATTTATCTTTATGA

Protein sequence:

>DPOGS215782-PA
MSSSSNISQSTYSTSEPETPNSTSKSQSQNFDQSLNNEDSIDQPLKNPQKELHQKRIQSLRKELEYLKATEWKFECDKGFQDEESDEDKEMFLAEELLQLEFEAENEASMINHSYPVEPPKLAAGAVCFQKTGIITDCGDDYVLIDGMLYFATQNSLSYNVNDKVLYLGYKDSNDSINVVRILENQGLFWGDEDEEDVENFNTIEHILIGQVDYREERMVYIVDSDLKFNLDNVAGTFVPIKGDWLEMKCTVQQNEKRPVDINTKQVLQVKSFNAIRTKTKTAIVTQWSGSEGVCDRQIYINNSALVNGSQINIGTKVMVEAIESNQGLCTWRALKLMTLEIGSEKNVAEESNEGQISLALEKEKKIHMTYPLKFENVKFDQTESIILNITNKSNNMYILNKWIVLSKKRDSQVCITPFINQPIKLSPEENISFTITCSPKFMGYAQECLVILFRGFQLKRHINIHVCSDHRQVNFDLNGDCHIMESDKADMMRKIRRNTNSYVPGVKPIKSPAFVSVKIGNFPIPDKIWAVVLGDSKQTICSNDFNRVLSFIERQLPYLSQDLNITNYIDKWHALLYMEEIQANLNMRVYDRSKVFLVHCDEYLGIEIPGLSEKRPSLIKGDRVIVKDIWNESNPEYEGYIHAINGDMVLMKFNSRFHEYYSGSDVSIEFHFSRAVYRRSHHCINQALSNLGPDILFPSRVITKESQVSNDVLEDMKWFNPTLNKDQRNAVINILKGECRPMPYIIFGPPGTGKTVTVIETILQILTLIPDSRILVATPSNSASNLITERLIKYKDSFSGSVVRLIANYLVDSDTIPEDVKPFCATLDIAKENTTKSKHYVKDNIQLNCQKSLIVRHRVTIGTCYCLGSLKHLDIPRGHYTHIIVDEAGQATEPEIMLPLTFTNKEHGQIILAGDPMQLGPVVMSKYCKEFGLDVSFLCRLLECFPYLKDYESYACGFDKRLVTKLNDNYRSLKEVLTLPSEMFYDGTLVPNVDKSMPWTEKFIDATCQIFGSDDRNGGIFVYGIKGTNMRAQDSPSWYNPQEAAMVALTTCKLFKKNITEEEIGIITPYIAQTKYLRLLFDSMGLNQPKIGTVEDFQGQERPVILISTVRSSESHLEEDAKHYLGFVKSPKRLNVALTRAQVSVILFCNPHLLSKDHLWRKVISYAVSSDKYMGCDLPTSLLNNLSL-