Monarch geneset OGS2.0

DPOGS208080
TranscriptDPOGS208080-TA3702 bp
ProteinDPOGS208080-PA1233 aa
Genomic positionDPSCF300282 + 83109-88242
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0033480.056.59% 
BombyxBGIBMGA007740-TA3e-16760.03% 
DrosophilaCG3493-PA2e-5548.47% 
EBI UniRef50UniRef50_E2ACD02e-6841.49%Golgin subfamily A member 4 n=2 Tax=Camponotus floridanus RepID=E2ACD0_CAMFO
NCBI RefSeqXP_002040070.13e-6243.37%GM15554 [Drosophila sechellia]
NCBI nr blastpgi|3838477634e-7343.39%PREDICTED: golgin subfamily A member 4-like [Megachile rotundata]
NCBI nr blastxgi|3454836831e-11029.86%PREDICTED: hypothetical protein LOC100116796 [Nasonia vitripennis]
Group
Gene OntologyGO:00055154.1e-20protein binding
KEGG pathway 
InterPro domain[1172-1231] IPR0002374.1e-20GRIP
Orthology groupMCL18331 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208080-TA
ATGTTTAAGAAATTTAAAGATAAATTGGCTGAAGAAGTCAAATCATCGCCTCAAAGAATTCAACAATTTGCGCAAGCTGCTCAGGCAGCAGTGACTTCTGCCTCAAGCAGTATATCTGATATTACAAACAATGATTTGTTCTCTATTGGAGATAATGATGCCTCAAATAAAAATCGTATTCCAAGTAACAGTCAACAAGGATCGTTTCATGAAGTACCTCTAAGCCACACAGGAGCAATACAGCCTAATATTCTTTTAGATTACTCACCCAATCATGAGGGTAATATGGACAGTTCAAGACAGAGGAGGTTGTCAAACAGTTCATTTGCAAGTGACATCTCCTTCAGATTACCCAGTTATGAGAGTCCATCTATGTATCATCTGCAGTCGGATATGGAAGTGTCAGCTAGTGAGGCTGAAGATAAAGGATTTCCTGGTGAAACTGTCAACCTTGATAGGGTAACGAAGGAGCAGTTGTATTCCGCCTACCGGAGGACACAAGATCGATGTACTAAATATAAAACACAGTACTCGGATCTTGCACGACATTATAAACTTCTGGAAAGAGAAAACGCCAAAGCAAGGAATGTCTTAGTAGAGACACAGGATAAAGCTTTGCGACGAATATCAGAACTAAGAGAACAATGTTCTCTAGAACAAAGTGCTAAGGCCCACCTTGAAAAGGCCCTCCGAATAGAGATTGAAGAGAAAAACATGAAAATAGATACTCTAAATACACAACTCAATTATCTAAAAAGCAATAATAATCCTAACGAAGAACAAGTTATACAAAGTAATGCAGAAAAAATAAAGGAGAAAAGTGATAATGAAGCTTTGTTGATAAATTTATCCACGGATAGCAATGAAAACAAAGCCACTGACGAACCTGTAGCCTCCGAAGTGACAGTATTAAACAATAAAATTGAGAAAATGGAACAATTGATAAGTAAATATAAAGATTCATTGAAAACTACTAAGGAGAAGAATGCTCAGTTGACAACAGAATTACAAATAATTTCAACTGAACTAGAGTCAAAAGTGAAGGAAAATGATCAATTAAAAGCTGCAACCGTATCACTAACTGAAGCTAGGAAGAAAATACAAGAATTGAATGAAAAAATTGAAGACCTAGAAAATAAAAACAACACATACGAATTTTCAAAGCAAAAAGAAATATCGATACTCGAATTGAATTTCAAAAATGCTCAAGAAGAAATTCAAAAATTACAAGAAAATGTTCAAATACTAAGTAAGCGAGAAGAAGAGTATGCTATCTCATTGGCAGAAAACAAATTAAGCATTCACAAGGAATTAGAAGACAAAGAGAAAGAAATTAAGTCCTTAAAGGACAGTTTGGAAACTAGCCAAAAAGAAATTCAATCACTCAATATTTTGAGCAATGAGTTTAAAAATAATATTAAACAGTTAGAAGAAGAACGGCTAAAGTTAAAGGATGAAATTAATATTTTAAATGTTGATAAAACAAAAGTAGAGGGATTCAAATTAGATTTAGAAAATCTTGCACAAAAATGTCAAGCTCTTGAAAATGTAAATAGTAAGTCTGAGGAAGAATACAAGTGTCTGCAACTGCAATTGAAACAAGAAACGGCTGAAAAATTAGCAATGATTGATAGGAATACGTACTTAGAAAATAGAAATAAGTTATTGCTTGAAGAAAATACAAAAAAAAGTACTAAAATAAAAGAATTAGAAAAGGAAATACAAACGCTCGAGAGCAATCTGGAAACCAGCTTGGAATCTGAGATTCAAGAAGAGAGAGTTGAGTTGATAAAACTGCAATCTGAAATAGAAAAGCTTTTGACTAATTACGAATTAGTTCAGAATCATAACATTGAATTAAAGAGCAAAATAGAGGAGTTGGTTTCAGAAAATGTGTTATTGAACCAGAAGGCGGTCAAAATTGAACAAATGAAAGAAAATTGTAAAGATTTGGAAAAGAAAGTATCTCATTTACGTAAACTTATAAATATGAGCTTAGATGAATCAAACATGTTGAAAAAGTTTATAAACAAACATTTCCAAATATTAAAAGATAAAATGATGTTGTTAGATGAGGCTTTAAAGAATGATACCGCCAGTTCAAAAATGTCGGATGATTTAAAAGAAACCAATCAATCCTTACTTCTTCAGATAAAGACTATTTCAGATCAATTTAAATCCGTTAGCGATGAATTAAAACAAAGAGTGACTGAAAATGACAACCTTAAAGTTAAATTAAATCAAGCTGAATCAGATTTAAACTTACATGTACAAAATATAGATATTTTAAAAAATGAAAACCAAACAGTTCTTAATAAGTTGAGTGTATTGATAAAAGAATCTGAGAGTTTTAACGATAACTTGATGAAATGTAGAGAGGAAACAGCAGCGACAATCAAAGAACGAGATCAAGCCAAAGAAGAAAAAGAAGAAGTGAACAAAAAACTTTCTCATCATATAAAAGAAAATGAGTCACTAAAAGACCAAGTTAATAAAATCAATAATCTATATAAGAATGCTGTTGAAAAACTAGATGCTTTGAATAAAGAAATTGATGATTTACGTGATATGAAATTAAATTACAAAGAGAAATGTAACAGTACAGTTACATTACAGAAACGTATTGATGAACTTACTCAAGAAAATGCTTCATTGAAAAAAATGTCTGAAACAATTGCCAGTGCTTTAGAAGGAATTGAGTCTGAATTGAAAGATGTACGTAATTCTCACACCGAAATTGAAATGGAAAAAGACCATTTAAATTCTATTATAGAGAAACTAGAAAAGAAGGCAGATGTATTAAATCATGACACCCAAACAGACCATTGTAATGATAAAGAATTACTTGAACAAGAAATCAAAGATTTAAAAGAAGCCAATGAGATATTAAGTCATGAAAATGAACAACAAAAGATAACAATCAGTAACACAGATGATATTTTAAGTAAACTGAATGATGTGATGAACAATTACGATGTATTAAGAGAAGAAAAAAGGCGTTTACAGTCTGATATAGAAGGCCTACAAACACATTTGACTAAGGTTTCTAAGGAGAATAGCAATTTAAACGACAGATTGCGTGAGTTGATCGCCAGCAGCGACAATGTTAATGATAAAAGTGAGAAATCCTCGTATGAATTGCAATGCTTGATGGACGAAGTGAAAGCTGGACAAGAAAAAATTGAGAATCTTATTAGAGAAAATACTTTACTTGCAGAAGAAAATTTAGAACTTAAAGATCAAATAAATACACAAACAACCGATAAAACATCTGTAATGAATGATAGCAATAAGTATATTGGCAGTGATAATGTAATGGAAAAGATTAATGACTTATTGGACACAAAGAAAACATTAGAAAAAGAAGTTACAGATTTAAAATTAATAAATCAATCAGTTAGCGGGAACATGCAACAAGTTCAAGCAAATAATGAAAAGTTAAGATTATCCAATGACAAACTGGAAAGAAGATTAGATGAGGCTTTAGTTAGTTTAAGGCATTTGCATTCTCTGCAAGAGAATACAGAACTTGAGTATCTTAAAAACATCCTATATGAGTACCTCACAGGATCTGGGACACATTCCATAACACTCGCCAAGGTTTTGGCCGCTGTAGTCAAATTTGATGATCGACAGACAGAAGCAGTTTTGCAAAAAGAAAAAGAAAGACAAGGCTTTTTGCGGCAACTTGGCATTATTTGA

Protein sequence:

>DPOGS208080-PA
MFKKFKDKLAEEVKSSPQRIQQFAQAAQAAVTSASSSISDITNNDLFSIGDNDASNKNRIPSNSQQGSFHEVPLSHTGAIQPNILLDYSPNHEGNMDSSRQRRLSNSSFASDISFRLPSYESPSMYHLQSDMEVSASEAEDKGFPGETVNLDRVTKEQLYSAYRRTQDRCTKYKTQYSDLARHYKLLERENAKARNVLVETQDKALRRISELREQCSLEQSAKAHLEKALRIEIEEKNMKIDTLNTQLNYLKSNNNPNEEQVIQSNAEKIKEKSDNEALLINLSTDSNENKATDEPVASEVTVLNNKIEKMEQLISKYKDSLKTTKEKNAQLTTELQIISTELESKVKENDQLKAATVSLTEARKKIQELNEKIEDLENKNNTYEFSKQKEISILELNFKNAQEEIQKLQENVQILSKREEEYAISLAENKLSIHKELEDKEKEIKSLKDSLETSQKEIQSLNILSNEFKNNIKQLEEERLKLKDEINILNVDKTKVEGFKLDLENLAQKCQALENVNSKSEEEYKCLQLQLKQETAEKLAMIDRNTYLENRNKLLLEENTKKSTKIKELEKEIQTLESNLETSLESEIQEERVELIKLQSEIEKLLTNYELVQNHNIELKSKIEELVSENVLLNQKAVKIEQMKENCKDLEKKVSHLRKLINMSLDESNMLKKFINKHFQILKDKMMLLDEALKNDTASSKMSDDLKETNQSLLLQIKTISDQFKSVSDELKQRVTENDNLKVKLNQAESDLNLHVQNIDILKNENQTVLNKLSVLIKESESFNDNLMKCREETAATIKERDQAKEEKEEVNKKLSHHIKENESLKDQVNKINNLYKNAVEKLDALNKEIDDLRDMKLNYKEKCNSTVTLQKRIDELTQENASLKKMSETIASALEGIESELKDVRNSHTEIEMEKDHLNSIIEKLEKKADVLNHDTQTDHCNDKELLEQEIKDLKEANEILSHENEQQKITISNTDDILSKLNDVMNNYDVLREEKRRLQSDIEGLQTHLTKVSKENSNLNDRLRELIASSDNVNDKSEKSSYELQCLMDEVKAGQEKIENLIRENTLLAEENLELKDQINTQTTDKTSVMNDSNKYIGSDNVMEKINDLLDTKKTLEKEVTDLKLINQSVSGNMQQVQANNEKLRLSNDKLERRLDEALVSLRHLHSLQENTELEYLKNILYEYLTGSGTHSITLAKVLAAVVKFDDRQTEAVLQKEKERQGFLRQLGII-