Monarch geneset OGS2.0

DPOGS214191
TranscriptDPOGS214191-TA1326 bp
ProteinDPOGS214191-PA441 aa
Genomic positionDPSCF300014 + 22976-27615
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0023055e-15461.43% 
BombyxBGIBMGA005921-TA3e-12876.12% 
DrosophilaCG9867-PA1e-13354.77% 
EBI UniRef50UniRef50_E2BFN91e-15857.85%Uncharacterized glycosyltransferase AER61 n=6 Tax=Formicidae RepID=E2BFN9_HARSA
NCBI RefSeqXP_001599568.11e-16359.43%PREDICTED: similar to ENSANGP00000012343 [Nasonia vitripennis]
NCBI nr blastpgi|3454833456e-16259.43%PREDICTED: uncharacterized glycosyltransferase AER61-like [Nasonia vitripennis]
NCBI nr blastxgi|3454833454e-16259.43%PREDICTED: uncharacterized glycosyltransferase AER61-like [Nasonia vitripennis]
Group
Gene OntologyGO:00167571.5e-30transferase activity, transferring glycosyl groups
KEGG pathway 
InterPro domain[147-394] IPR0076571.5e-30Glycosyltransferase AER61, uncharacterised
Orthology groupMCL12470 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214191-TA
ATGTATTTCTATTATAAATTTGGTAGTACTATGAAGTGTTTAATATTTTTACTTTTCCTATACATTGGCGATATTCTTTCGGCAACATTAAATGACTTAAATTTACCTCCAGAACACATTCCGTACCTTTTTAACCAATTTCCTGACTTGGCCAATGCTTGTAAGGAAAGTATAAATTGTCCATATAAATCTTTGACTAACAGTAAAGTTTGTTGGGGTTATGAAAACAACTGTCCTCCAAATAGCTCATATCATGTAAGACCCAAATGTCCCGGTGATCATCGAGGTTGGGTCAAAACTAAACAAGCTCAAATTGATACATTTTATACTCAGGCAGACTTTGGTTATGTCAAGGAACAAATTCAGGATTTAATGGTTATGTGTGAAGCAACATACCCTTATGATAGTTCCTTAGAATGCTCAAAGTACTTAAGATTCTGCAGAGGTCGTAACCTCCTTCTAAATTTCACCAGTTTAGTTGGTAGAGGTGATAATCTCAGATATAAGATGGATATTCTTGGTCCAGGAGAAATAGGTGGTTATTGTAAATTTCATTCTACCAGACTGAAAAACGAAGCAGAACATATGAGTGCCTTGCAGTCTTGGGCTCCAGAGTTTGTCAATTTTGTTAAGACTCCAGGAAGACCCATACCTGATGGAATGTGCGATATAACTATTGACAAGCCAACATATATAATGAAATTAGATGCCACTGTCAATATGTACCACCACTTCTGTGACTTCTTCAACTTGTACGCGTCACTCCATGTGAACTCCACACATCCTTCGACATTCAGCAGGGACAACCACATACTTGTTTGGGAGACGTTTACCTACGACTCCGCATTCAAAGATGCTTTCAAAGCATTCACATCAAACCCAATATGGGACTTGAAAGAGTTTAGAGGGAAAACTGTTTGTTTCAAGAATGCCGTGTTCCCTCTCCTGCCACGGATGATTTTTGGACTGTATTATAATACGCCGTTGATATACGGTTGTGAAACAAGCGGTTTGTTCCATTCATTCTCAAAGCACATACTCCATTCTTTAAACGTGAAGCTTCATTTGCGAACTGACGATAGAGTCCGCATCACATTACTGTCGAGAGGAACAACATATCGCACTATACTGAACGAACAGGAAATAGTTGAAGCTTTGCTTAAAGTGAAAGGTTATTACGTGCAGAGGGTAGTTTACGATAGAACTGTGCCATTCACTAAGCAGTTGGATATAACTCATAATACTGATGTGTTCATCGGCATGCACGGCGCAGGTCTGACGCACCTCTTGTTTCTACCGGATTGGGCAGCTCTGTTTGAAGTGTAA

Protein sequence:

>DPOGS214191-PA
MYFYYKFGSTMKCLIFLLFLYIGDILSATLNDLNLPPEHIPYLFNQFPDLANACKESINCPYKSLTNSKVCWGYENNCPPNSSYHVRPKCPGDHRGWVKTKQAQIDTFYTQADFGYVKEQIQDLMVMCEATYPYDSSLECSKYLRFCRGRNLLLNFTSLVGRGDNLRYKMDILGPGEIGGYCKFHSTRLKNEAEHMSALQSWAPEFVNFVKTPGRPIPDGMCDITIDKPTYIMKLDATVNMYHHFCDFFNLYASLHVNSTHPSTFSRDNHILVWETFTYDSAFKDAFKAFTSNPIWDLKEFRGKTVCFKNAVFPLLPRMIFGLYYNTPLIYGCETSGLFHSFSKHILHSLNVKLHLRTDDRVRITLLSRGTTYRTILNEQEIVEALLKVKGYYVQRVVYDRTVPFTKQLDITHNTDVFIGMHGAGLTHLLFLPDWAALFEV-