Monarch geneset OGS2.0

DPOGS203382
TranscriptDPOGS203382-TA1362 bp
ProteinDPOGS203382-PA453 aa
Genomic positionDPSCF300003 + 501408-504684
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0072868e-8737.56% 
BombyxBGIBMGA002045-TA2e-8241.43% 
DrosophilaCG9384-PA7e-8036.57% 
EBI UniRef50UniRef50_Q9VUH41e-7736.57%CG9384 n=19 Tax=Diptera RepID=Q9VUH4_DROME
NCBI RefSeqXP_002047901.14e-8137.96%GJ13695 [Drosophila virilis]
NCBI nr blastpgi|1953782588e-8037.96%GJ13695 [Drosophila virilis]
NCBI nr blastxgi|1953782586e-7937.96%GJ13695 [Drosophila virilis]
Group
Gene OntologyGO:00160204.9e-78membrane
GO:00167584.9e-78transferase activity, transferring hexosyl groups
GO:00059754.9e-78carbohydrate metabolic process
KEGG pathwaydvi:Dvir_GJ136951e-80 
 K00738 (MGAT4A_B)maps-> N-Glycan biosynthesis
InterPro domain[10-297] IPR0067594.9e-78Glycosyl transferase, family 54
Orthology groupMCL10892 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203382-TA
ATGGTAGTGCAAAAAAAATGTGTAGGGTGGGGACCGGAAGAGGAAACTAAAAGCGAGAAATCGGAAATGAATTCTTTTGCTATTGAACTAATGCCTCATTTGATTCAACGTTCGTCCAGCCTGAAGCCGGTTTTCTTACTGAATGGCTCTCGACAGGACTGCGAGCTGGTTATAGGTATCACAACTACACACTCGGATAAGGAACTCTATATTCTGGTTTCACTCATGAATTTGATTGACGCCATGAATGAAAAGGAAAAGGGCAAAACTCTTATAATAGTTTTAGTAGCTGAATTGATTTTGGATCAGGTGCGCCCGTTATTGGAGCTACTTGCCTTTGTGTTCGCTGAACACGTCCATAGTGGTCTTGTCGAAATAGTGGTACCCTCTCCGCATTATTACCCCGATTTGGAAACGTTAACAACTAATCCCCTTGACCCATCCAATCGAGTGAAGCGCCGTACGAAGCAAAATCTTGACATATTGTATCTTATGGCCTATGCTCGCCCCAGAGGGACTTATTATTTAAGTCTTCGAGACGATATAACCGTTAAGTACCGATTCGTCGAGCATATAATGGATTTTATAAAGACCACCTCCGACACGAATCCTCACTGGTATGTTCTGGAATTCTGTAACGTCCGAGGTGTTGGGAAAGTGTTTAGAACTAAAAGTATGGTCCAATTCATGACATACATTCAAATCTTCTACAAAAATATGCCGATTGATTGGCTTCTGAACAGCTATATCGCTAATAGCTCTTGCTCTCGAAACAAGACAACGGAAACATGTAAGAAGAATAAATTGAAAAGCAAACCTAAATATCCAGTATCACTGTTTAACCACATTGGATTGTATTCTACTAGCGAGGGAAAGGTTCAGATATTGAAGCATTTAAATACTGACGAGGAAACTCTATTCACTGCCCACGACAACCCGCCGGTTGATCGAGTTTATACAGACATACCGGCATACGATAAGCATACGCTTTTAAGAGCTTATGAGGGAGAAACGTTCTTCTGGGGAAAGAAGCCGGTGGAAGGAAACGTTGTGGAATTTTGGTTTAGGGAGCCCACTATTATTGTCAGCTATGCCTTCGGAACCGGCAATATTTTGCATGAAAAGGACAAATTTTATCACGCCGTGGTGGAAGTTCTTCCGTATAAGAGACATCAGTTCATCTATGACAAAGATTTCGGAGAATTTGGCTATGTTTATGGGGATCTGCATTTCGGAGAGTTGGTTGCTATTAGAATTCGAGTCACCAAGAACAGCACCCACATGATTGTTCTATCTGAGATACAACTGGTGACTATTGCGCAAGCTAAATCTAGGCAACGCAAAATAGTTATAAATTATTAA

Protein sequence:

>DPOGS203382-PA
MVVQKKCVGWGPEEETKSEKSEMNSFAIELMPHLIQRSSSLKPVFLLNGSRQDCELVIGITTTHSDKELYILVSLMNLIDAMNEKEKGKTLIIVLVAELILDQVRPLLELLAFVFAEHVHSGLVEIVVPSPHYYPDLETLTTNPLDPSNRVKRRTKQNLDILYLMAYARPRGTYYLSLRDDITVKYRFVEHIMDFIKTTSDTNPHWYVLEFCNVRGVGKVFRTKSMVQFMTYIQIFYKNMPIDWLLNSYIANSSCSRNKTTETCKKNKLKSKPKYPVSLFNHIGLYSTSEGKVQILKHLNTDEETLFTAHDNPPVDRVYTDIPAYDKHTLLRAYEGETFFWGKKPVEGNVVEFWFREPTIIVSYAFGTGNILHEKDKFYHAVVEVLPYKRHQFIYDKDFGEFGYVYGDLHFGELVAIRIRVTKNSTHMIVLSEIQLVTIAQAKSRQRKIVINY-