Monarch geneset OGS2.0

DPOGS212130
TranscriptDPOGS212130-TA3339 bp
ProteinDPOGS212130-PA1112 aa
Genomic positionDPSCF300038 - 198-7346
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0038230.069.49% 
BombyxBGIBMGA006759-TA8e-12355.69% 
Drosophila% 
EBI UniRef50UniRef50_E2B4E82e-5622.35%Coiled-coil domain-containing protein 108 n=2 Tax=Formicidae RepID=E2B4E8_HARSA
NCBI RefSeqXP_002738067.11e-2720.76%PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
NCBI nr blastpgi|3072143689e-5622.35%Coiled-coil domain-containing protein 108 [Harpegnathos saltator]
NCBI nr blastxgi|3072143684e-5822.51%Coiled-coil domain-containing protein 108 [Harpegnathos saltator]
Group
KEGG pathway 
Orthology groupMCL22139 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212130-TA
ATGGAAAATTGTTTGTCAGAAAGTATTTCTGATATTAAAATTGTTGATTTCAATTCCATTCCTACTGACACTGTATCGGAAAGGATAATTACAATTAAAAATGTTACAAATGACAATTTAACTTATAGGGTGTCTTTGCTTCATGTTTTAAATAACATAGACCAAGTATTTAAGTTGTCAGTTCCCCCCGATCCAATAAAATCACTTGAATCGGTTGAGGTTAGAATCATTTTCAAACCTAAAGTACCAGGACAATCTTACACTGAACATTATCTCATAGATGATACGGCTGGCAATGTATATAGATTAACTGTCAAGGGCAAATGTTATGGTACAGAAGTGACGATTAATAAAAGAAAACTTGAATATAGGATATGTAGAGGAGTATCCGAAAACAGAAAGGAACTTATAAAAATCGTAAACAAATCCTCCCTCGACGCAACTTATCAATGGCTGTTACCTGTAGGCGGGCAAGGATTTTTTCAAATTTCAACTGGTAACTGCGGTGTTATTCGTGCATTTGAAGTTATTACTACTGCAATTATATTCACAGGTTCAGCTCTAGGAGTGTATACAGCTGAACTTGTATTTTTGGTCTTGAATCAGGAGCCGTTATTTTTAAATATAATAGCCTCAGTTGTTTTATCTGGTAACCCTCTATACGGAATACAGGAACATGTATTCGAAAAAAGACGTAAAAGCCGATCAGTACATTTAATGGAAAACAGTTTAAACTTTTTGAGTTACATACCGTCGGCGAGTGTTTTTGAAAAATATCTTGATTTTGGAAGCGGAAGCGTTAGTGACATTACTTTAAATATATCACAGACCTTGTGTGTTACCAATCATGAAACAGAAGAAGGTTGCATACAATGGATTCCAGACCCTGACAACGTATTTTTGATCGATCCGATAGCGTCGATCATTCCATCAAATGAATCGAGACTGTTTACGATACGTTTCAGGCCAAAAATCGAGAACGAAGCCTACGGTTATCTGCTTTGTGGTGATTTTCAATATAAAATATACGATGAAGAACAGTTACAAAATGTTAAAATGAAGCATAAATGGGTTAGAATTCCGTGCATTGGTAACACGTGGCCGCCGTGTACGGAGTGGAATACTGAATGGGACTGTCCCATAGAGGTTGTTATGCCCCCAACCGTCCCAACAAGAACAACCTTTGCAAACTTTTTTCTATCAAATAAACTCGGAATACCTTTGACCTATAAATTGGAGGCTCCTGAAAAAACGAATTTTGTGGCCTTGCCCATGTGCGGAATTGTACCGGGGCGTGGATGGCAAATAATAACAGTTGCTTTAGAACCGAAGTCCTTTGGAGAATATTGTGAAAATTGGGATCTCATCATTAACAAAATACATAAAGCTCGTATAAGCTTCATAGGTAATGCAGAAGTCAGCCACATTGAGATGATGTCTCACGGGTACAATCCTGACGCTCACGCCATGTACGAGTTCCCGCCAACCGTCACCGGGTGCACCAATTATTGTACCGCGTACCTTCACAACCTCACCAGAATGGATATACACATGAGAGTTCTTTCGAATGCCTCCTGGTTGGGTGCGGACAATTGCGGTTCCATAGTACTACCTCCTAAGGAAATATTTCACTACCACTGGTGGTTCTTCCCGAGAGAACCTGATAAAGTTTACGAAACAACTATCACGTGTTCCTGTATTTGTTTAATTAACGGGAAACCAGTTGGTGAGCCCACGGAAATATTTATACATATCATGGGTTTCTCTGAACTGCCAGATCTGAGGGTTTTACCGAAGTCGTACAATTTTCATGATGTAGTGGTGGGGGAGAGCAACACTTTTAGCGTCACTCTATACAATTATGGTTCTTGTTATTTCACATCTAAACTGTACCGTGTTATTAACGGCATGGGGGATGACTACAGCCGGGATAAGTTTGAAATTGATTCGAATATTAACAGCCTCAAACCTTCCAACCATTGCGAAGTGAAATTTACCGTGATCGCAGACGGAGCCGGTTCGAGACAAGTAGATATTAAATACACCGTTTTATATAGAACTGAATTTGACGAGGTAGAAGAAATACAGCCCATACAGAAAACTATATGTATTATTTGGTACGATGGGATATACCCGACGATTAAGGTCAAACGTACAATATCGGTAAAATGTCCGGTGATTTTGAGCAACCATTGTGTTTGGAATATGGTGAACGTGGAAGAATTGAATAAGGCTTTAGAGGATTGTCGTCCAAACAGACCGATGAACGTTAATATTTACGCACCGGAGCTCTGTGCGAGACCTCAGGATGTGGAAATCATCTTTGTTTTGGGATGTGTGTATTCAGTGAGCGTGCCGTTCAATTTGAGGCGAGAGAAAATATGTGACTGTGACATGGTTGAGGTTCAAGTCGGCATATCCACTTATGAGATGCGTCACACCTGCATACATAGACCCCTGGTTGAAATATCGCCACTGAATGGAATTGTTACGCCGGAAAAGCCAATACTTTTGAACATTCGTTTCAAATTTACGTATGAGGGAAGCATCCTCACTGCTCTGCGAAAGCCCACCAACGATAGCGATTATCTGACAGTGTTGGATTGCGGCAGAGTTCCTATTAATAATCTCGATCCAGTTATAAGGATCATGTGGTTTTACAATCCAACCGAGGTCTTTACAACATGGCGTCTGTTGAGAGGAAACACTGTGGCATCCACCACAATTATCCGTTGCCTTCAATACTTCGCTGAAGTGCCACCGATGGACAAACTGGCGATACCATTCGCCTTTATGCCCAAAGAAATGATTGACTACGAGATAACGTCCTTTGGATACGACATCGTAAAAATTCTGATCAAAGGTCAAGGGGGTCTTCCCAATTGTCTCGAGACTCGTCTTGATATACCTGTGTATGTTGACAGAATTATAAGATCTGCTTACAGACAGAACGTTGTTTATTTATCAAAAGAACACATCACGCTGCCAATAATGACAACCCATTCGTTGTTGAGGGATATATTAGCCGTTGTTAATGACACGGAGAATGTTATAAGGTTTATCTGGCTGCCGGAAAGAATGGCGAATATTGTGAACGTAGTCATGACGCCATGGTGGGGTGTCATTCAGCCAAAGAGCACGGAGTGGATCACAATGACAGTCTACACGTTACAGGAACCTGCAACGTTCACAACCACTGTGACGTGTGAGATTCTGGATCTAACTGTCCGAAGAAACTATCAGAGGAACCAAATGTTGAGGAGAAACAAGGTGGAGAAGTGTGCTCAAGAATTTATAATCACTGAGCAAGGGACTATTCATCCCGTTAGTAGTATTTAA

Protein sequence:

>DPOGS212130-PA
MENCLSESISDIKIVDFNSIPTDTVSERIITIKNVTNDNLTYRVSLLHVLNNIDQVFKLSVPPDPIKSLESVEVRIIFKPKVPGQSYTEHYLIDDTAGNVYRLTVKGKCYGTEVTINKRKLEYRICRGVSENRKELIKIVNKSSLDATYQWLLPVGGQGFFQISTGNCGVIRAFEVITTAIIFTGSALGVYTAELVFLVLNQEPLFLNIIASVVLSGNPLYGIQEHVFEKRRKSRSVHLMENSLNFLSYIPSASVFEKYLDFGSGSVSDITLNISQTLCVTNHETEEGCIQWIPDPDNVFLIDPIASIIPSNESRLFTIRFRPKIENEAYGYLLCGDFQYKIYDEEQLQNVKMKHKWVRIPCIGNTWPPCTEWNTEWDCPIEVVMPPTVPTRTTFANFFLSNKLGIPLTYKLEAPEKTNFVALPMCGIVPGRGWQIITVALEPKSFGEYCENWDLIINKIHKARISFIGNAEVSHIEMMSHGYNPDAHAMYEFPPTVTGCTNYCTAYLHNLTRMDIHMRVLSNASWLGADNCGSIVLPPKEIFHYHWWFFPREPDKVYETTITCSCICLINGKPVGEPTEIFIHIMGFSELPDLRVLPKSYNFHDVVVGESNTFSVTLYNYGSCYFTSKLYRVINGMGDDYSRDKFEIDSNINSLKPSNHCEVKFTVIADGAGSRQVDIKYTVLYRTEFDEVEEIQPIQKTICIIWYDGIYPTIKVKRTISVKCPVILSNHCVWNMVNVEELNKALEDCRPNRPMNVNIYAPELCARPQDVEIIFVLGCVYSVSVPFNLRREKICDCDMVEVQVGISTYEMRHTCIHRPLVEISPLNGIVTPEKPILLNIRFKFTYEGSILTALRKPTNDSDYLTVLDCGRVPINNLDPVIRIMWFYNPTEVFTTWRLLRGNTVASTTIIRCLQYFAEVPPMDKLAIPFAFMPKEMIDYEITSFGYDIVKILIKGQGGLPNCLETRLDIPVYVDRIIRSAYRQNVVYLSKEHITLPIMTTHSLLRDILAVVNDTENVIRFIWLPERMANIVNVVMTPWWGVIQPKSTEWITMTVYTLQEPATFTTTVTCEILDLTVRRNYQRNQMLRRNKVEKCAQEFIITEQGTIHPVSSI-