Monarch geneset OGS2.0

DPOGS214201
TranscriptDPOGS214201-TA2100 bp
ProteinDPOGS214201-PA699 aa
Genomic positionDPSCF300014 + 176527-178875
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0150751e-17352.77% 
BombyxBGIBMGA006225-TA2e-7850.14% 
DrosophilaGM130-PA4e-2246.08% 
EBI UniRef50UniRef50_Q9GQI68e-14847.46%Golgin-80 n=1 Tax=Manduca sexta RepID=Q9GQI6_MANSE
NCBI RefSeqXP_966998.22e-4630.85%PREDICTED: similar to GA20259-PA [Tribolium castaneum]
NCBI nr blastpgi|115270333e-14747.46%golgin-80 [Manduca sexta]
NCBI nr blastxgi|115270332e-15946.66%golgin-80 [Manduca sexta]
Group
KEGG pathway 
Orthology groupMCL25219 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214201-TA
ATGGATATAAGAGCCGAAAAACTAGCAAATGCTCGGAAAAAGTTAAGAGATCATCAAGTAAAGAAAACTGTAAACTTAGTTCAGCCTTCTGAAAAACAACAACAGACAACAGAAAAACCAACACCCGAAATTGAGAATCCTCAAAAGGCAAATTACTATGAAATTCCTGAAGGACAATTTAGTGCAGAAGGTATACAACCTAACATTGGAGCCAGTACACAAAACAAAAGTATCGATAACTTACCGGTTCAAAATCCGGAGGTAAATGTAACCGAAATTCTTATCGCTAATAATGCCAATCTTGAGATGCAAATAAAAAATTTTACTGAGAAGTTAGCACATTTAGAATATTTGTATGGCTCAGAAGTGTCTAGTCACAATACTTCTAAACAAAGGATTATTAATCTAGAGAGTGAACTCTCCGATTTATCAAGCAGATATTTAAAAGTTGAACAAGATATGCAACAGAAAAATAGGGAATTTAATGATTTACAAGTCTTGTATAATTCTTTTGTGGAAGCAAATAACAATTTATCAGAACAAGTTGAATTTACGAAGTCAATGCTTACAGCAAAGGAATCAGAAAACACTCACTTACAGAGTCAAATAAATAATTATCATAATCAGTTAGACTCTGCTATGCTGCAGATACAACAGCTGACAAATAACAGTAGTATTATGCAGGCCCCTACAAACAATTCTGTGGAAGAAATTGAATTATTAAAAAAAAAGATATCTGCTTTAGAGCAACAAACAGTTGTCCTACAAAAAGAGAGAGATAATAAGAATTCCCATTATGAACATTATGTAAAAGAACTAACTGAACAACTAAAAAATGAAGTAAATAAAAATGAGAGTCACTTACAAACAATTCAGAACCTCTACAATAGGGAAAATAGTCTCATAGAGCAGATAAGTGATATGGAAATAAGACTACAAAATTACCAGAAGAGAATTGAAAATGAAGTTATAAAAGTAGAACATGTTGATAATAGTCAGGAATTACAGGATAAATATAATGATATCAAGAAACAGCTCGAAGATACTAAACTGAAACTTAGCAATCTTCAGGAGGAATATGCAAAAAGTATTGAAACAATTCAAGAATTATCGGCTTCTAAAGAAGTAGTTTGTGATCATGACAACATTAGTATATCAAAATTGAATGCTGATATAGCAAGTGATAAACTGGCAGCTCAGAGAGCAACAGAACAAAACAGAAAGTTAAAACAGGATGTTGAGAATTTAGAACAGGTTGTAGTTAAAATTAATAAAGATAAGTTAGAGTTAACTGAAAAATTAACACATGAAAAGCAATTAAACAAAGATATTGTTTTAAAATTGGCTGAAGTTGAAGAAAATGCTAAAAACATGACAAAATTATTAAAAGCAAAGGATAGCGAAATGATAAGGCTGCAAAATAATAACAGAGAAATTGAAAAGAAATACGAAGACATTTTGCAAGATATGAATCATATGAAACATGTCACAAGTGTAGAACACAACCATGAAGCAGTCGGCGAGATTTCCAATAATAACTGTTACAATGAAGAGACATCACTAATTCCAGCTCATAACTGTATGGAAACTAGTAACGAAATAGCAGAATTAGACATAAAACATGAACAAAAATTTATACCTAAAGAAGATGCAATGGTTAAGCTTCAAGAACGCTTTCTTAATATAATGGATGAAGTAGCAAATTTATCTGATGAGAAACATAGGCTAGAACACATAATTCTCCAACTACAAAATGAAACTGAGACTATATGTGAATATGTGGCACTTTATCAGCAGCAGAGGAGTTTATTAAAGAAACGTGAAGAAGAAAGAAGCAACCAGCTGAAAATATTCCAAAGCGAATGTGATCTGCTCAAATCTCATTTGGAAGCTCTTCGTGAATTATTATTACGATTAGCAGCGGATGAAGAATTGGATTCATATTTGAAAGACGAAGCAAGATTTAACGATATAATTAGGGTTAAAGACTTACTTGAAAAGTTGCAAAACTGTTCATTAATAAATCCCAAGTACAATACATTAGATCTTAATATTTTTTACCCTTGTAATTGTTGCTCTGGACAACTTATTACCATTTAG

Protein sequence:

>DPOGS214201-PA
MDIRAEKLANARKKLRDHQVKKTVNLVQPSEKQQQTTEKPTPEIENPQKANYYEIPEGQFSAEGIQPNIGASTQNKSIDNLPVQNPEVNVTEILIANNANLEMQIKNFTEKLAHLEYLYGSEVSSHNTSKQRIINLESELSDLSSRYLKVEQDMQQKNREFNDLQVLYNSFVEANNNLSEQVEFTKSMLTAKESENTHLQSQINNYHNQLDSAMLQIQQLTNNSSIMQAPTNNSVEEIELLKKKISALEQQTVVLQKERDNKNSHYEHYVKELTEQLKNEVNKNESHLQTIQNLYNRENSLIEQISDMEIRLQNYQKRIENEVIKVEHVDNSQELQDKYNDIKKQLEDTKLKLSNLQEEYAKSIETIQELSASKEVVCDHDNISISKLNADIASDKLAAQRATEQNRKLKQDVENLEQVVVKINKDKLELTEKLTHEKQLNKDIVLKLAEVEENAKNMTKLLKAKDSEMIRLQNNNREIEKKYEDILQDMNHMKHVTSVEHNHEAVGEISNNNCYNEETSLIPAHNCMETSNEIAELDIKHEQKFIPKEDAMVKLQERFLNIMDEVANLSDEKHRLEHIILQLQNETETICEYVALYQQQRSLLKKREEERSNQLKIFQSECDLLKSHLEALRELLLRLAADEELDSYLKDEARFNDIIRVKDLLEKLQNCSLINPKYNTLDLNIFYPCNCCSGQLITI-