Monarch geneset OGS2.0

DPOGS214243
TranscriptDPOGS214243-TA1797 bp
ProteinDPOGS214243-PA598 aa
Genomic positionDPSCF300014 + 1184009-1190992
RNAseq coverage302x (Rank: top 37%)
Annotation
HeliconiusHMEL0128333e-17257.82% 
BombyxBGIBMGA005964-TA8e-11775.00% 
Drosophilacbs-PA2e-3428.36% 
EBI UniRef50UniRef50_E2B5F61e-5934.84%Golgin subfamily A member 1 n=8 Tax=Formicidae RepID=E2B5F6_HARSA
NCBI RefSeqXP_971801.13e-5132.95%PREDICTED: similar to omega-crystallin, putative [Tribolium castaneum]
NCBI nr blastpgi|3072137685e-5934.84%Golgin subfamily A member 1 [Harpegnathos saltator]
NCBI nr blastxgi|3072137681e-7032.44%Golgin subfamily A member 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00055158.8e-11protein binding
KEGG pathway 
InterPro domain[541-577] IPR0002378.8e-11GRIP
Orthology groupMCL15022 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214243-TA
ATGTTTGCAAGTTTAAAAAGTAAAATCAAAGAAGAAACTGGCAGTGATATAAGTAAATTAACAAGTAGTTGGCGAAGTGGCGCTTTTTTAGGCAGAATAACTCTGAGAGATGATTCCTCTATATCGCCACAATTGCATACATACTTGTCCGATCGGACAGGAGCTACAGTGGACCACGCAGCTTTACATCAACAATATACAGAGCAGTTGGAAGCACAGCTGAGAGAAAAGGACTTGTATTGGGAACAGAAGATCGAGGAAATGAAGTTATCATTGGCAACTGTACAAGGCGAAGAAGCCCTGGCAGCACAAGCAGTAGCTCGCACAGCTCAGGCCGAAGCAGTGAAAGCTATAAGTGGTAGAGATGCCATGGAGAAACAACTAGCTGAATTCAGAACGCGACTTGCGGCAGCTGAGAGCGCACAGGACAAACTGGATGCACTCTCAGAAGAAGTGGAGCAAAGTCGTCGTGAATGGTCACGGACGCGTGCGGAACTGACCGCGGCGCTTAGTGCTGCAGATGCACGGGCACGAGACCTAGCCGACCAGCTGGCGGTCGTAAAGGCTGCCTTGCCTGTGGGGCATGAAATGCATCATATGCACGACGACGATAAACGCGAATTTAAACCTACCGGCTGCGAGGATTACGACCGAGTTTGCCGCGAGAGAGCGGTGTTGACGCGACAACTACAGGAAGCCAAGATGGCGCTCGCAGATGTCAAGACGTCATGGAGTGGGCAGATCGCCGCGTTAGAGACACAGGTGGCAAGAGTATCAAGACAGGCGGGAGAAGAGGGTGCGGAGAGACGGAGAATTGAAGTAGAGAAAAAGGAATTACGTGAGAAACTTGATAGTATGGCAGCTGAACTAGAAAAAACTAAACAGAATCTCGCCAACAGTGAAGCTAAGGTGAGCCGGCTGAGTGCAGAGGTCCAAAGTCAGGCACGGGAACTGAAGACACTGAGGGCCGCCGACATAACAACTGAGTTAGAAAATCGAATAAGAGCTCTTACAGAAGAAAACAGGGAATTGTTGGAAGCTTTGGAGACTGAGCGGGGGCTGGTGAAGGTTCTAGAAGCTAATCTCGAAGAATCGAACAGAATATGTGATGAGAGCAAGGAAGAGGTGGCCCATCTTAACACTGCCATGTCAGACATGCAACACGACTATATGGACTTACAGAGGAAATATGAAAAAGAGAGACGAGAAAAGGACGAGGCGTTATTAAGAAACGCTCACATGTCGCAAAACATTGAGATGAGTCAATGTAACGTGCGGTACCTAGAAAATGAAGTAGTCGACCTAAAGGCCAAGATTACTGAGGTCGAAAGCATTGTTGCGCAACACAGAGAGAACCAAGCCGAGTGTTTGATAATAAAAGAAAACGAAAAAGCGAGAACCAAAGAAATCGAAGATCTTCAGAGGAGAATTGAGGCTTTCACGGAAAGCGAAAATTCTCTTAAGAAAAGCATACAAGACTTAGAAACCGAAATCTGTGATAAAAATAAGAAAATTAAGACTTTAGATAACCGAATATCGGACATGAAAAAAACCCTACAGAGGGAGTTACAAAGCAGTAAGACGGATCTGACATCAGCTGAAGAACAGGACATTAGCAGACGATACTTGAAGCACGTTGTACTTCGATTCCTAACATCCCGTGAGTTGGAGGCGCGACAGTTGACTCGAGCGTTGTCTACGTTACTAAGGCTGTCTGCTCATGAGGAGGCTTTGCTCCGGGCAGCTCTGCCCGCTCGTACTGGCCTCGCCGCCTGGTTCCCATCGCTCAACGCATGA

Protein sequence:

>DPOGS214243-PA
MFASLKSKIKEETGSDISKLTSSWRSGAFLGRITLRDDSSISPQLHTYLSDRTGATVDHAALHQQYTEQLEAQLREKDLYWEQKIEEMKLSLATVQGEEALAAQAVARTAQAEAVKAISGRDAMEKQLAEFRTRLAAAESAQDKLDALSEEVEQSRREWSRTRAELTAALSAADARARDLADQLAVVKAALPVGHEMHHMHDDDKREFKPTGCEDYDRVCRERAVLTRQLQEAKMALADVKTSWSGQIAALETQVARVSRQAGEEGAERRRIEVEKKELREKLDSMAAELEKTKQNLANSEAKVSRLSAEVQSQARELKTLRAADITTELENRIRALTEENRELLEALETERGLVKVLEANLEESNRICDESKEEVAHLNTAMSDMQHDYMDLQRKYEKERREKDEALLRNAHMSQNIEMSQCNVRYLENEVVDLKAKITEVESIVAQHRENQAECLIIKENEKARTKEIEDLQRRIEAFTESENSLKKSIQDLETEICDKNKKIKTLDNRISDMKKTLQRELQSSKTDLTSAEEQDISRRYLKHVVLRFLTSRELEARQLTRALSTLLRLSAHEEALLRAALPARTGLAAWFPSLNA-