Monarch geneset OGS2.0

DPOGS204435
TranscriptDPOGS204435-TA1212 bp
ProteinDPOGS204435-PA403 aa
Genomic positionDPSCF300002 - 230756-231967
RNAseq coverage22x (Rank: top 78%)
Annotation
HeliconiusHMEL0095582e-1924.47% 
BombyxBGIBMGA001608-TA1e-4930.77% 
DrosophilaMgat2-PA6e-3339.20% 
EBI UniRef50UniRef50_D7EJQ11e-5633.60%Putative uncharacterized protein n=4 Tax=Coelomata RepID=D7EJQ1_TRICA
NCBI RefSeqXP_969675.17e-6036.23%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
NCBI nr blastpgi|910944331e-5836.23%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
NCBI nr blastxgi|910944332e-6035.78%PREDICTED: similar to Mgat2 CG7921-PB [Tribolium castaneum]
Group
Gene OntologyGO:00084551.4e-77alpha-1,6-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity
GO:00057951.4e-77Golgi stack
GO:00160211.4e-77integral to membrane
GO:00093121.4e-77oligosaccharide biosynthetic process
KEGG pathwaytca:6581732e-59 
 K00736 (MGAT2)maps-> N-Glycan biosynthesis
InterPro domain[65-392] IPR0077541.4e-77N-acetylglucosaminyltransferase II
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204435-TA
ATGCTTGACGGGTGTGTTATGACAAAGAAAATTTTTGTGTATCCGGTAGTAGCGTGTGCAATGTTGATATATGTTTATCTGAAAGTCACTTCGAGGCTGAGCAACAGTCCCATTATAATACCATATAACAATGAAAAAATACCTACTAAAATTCAAGCGAAGCTTGCAAAGAAGGACATCGACTATATACTGCATTTAAAAAACGTAATTAATGACGCCAATCGGAAACAGAGCACCGAAGATGATGAAATGGATTGGGATTTCAGCTTCATAATAGTGATACAAGTGTATAGAGGAATAAATAATCTCAACCATTTGGTGGATGGTCTCAAGCAAGCCAAAGGTATCGGTACCGCGTTGCTGTTATTCTCTCATAGCTTCTACAGCAAGGATATCAACCGAGTGGTGCGGAATATAAGCTTTGCGAGGGTCCTGCAAATATATTACCCATACAGCGTACAGTTACATCCTTTCGTGTTTCCCGGATATGATCAGAAGACATGCGACGAAGTCATGAGTTGTCGCAAATCAGATAGGAAGGCTTTGGTGATTCAACATAAGCACCACTGGTGGTGGTCGGCCAGTTTTGTCTTTGACCGAGTATTTGCTACAAGAAATTACAACGGTACCATACTTTTTCTTGAGGAAAATCAATATGTTACTTCGGATTTTATATACATTTTCCGAATTCTTGACGGATTGTTGAGGTCATCGAACCTACATTGTGATATTATAAGTCTGGGCAATGGTAAATCAAAATCATCCAGCTACCGACTCATTAACTCTTCAATTCTTCTAAAACCTTGGGATACAAGAGAAACACTCGGTATCGCCTTTAACAAAGATACTTGGAAGACAGTGAAAAATTTATCTGAGCACTTCTGTCGTTACAATGACAACCGATGGTCCAAATCTCTAATACATTTGTCAACTAAGACATCACTGGGGAAGTTTTACGCCTTATCAATAGAAGGTTCGAGAGTCTTCCGTTTAAATAAATGCAAGGACCACAAAACTTGCAATGAAACCAAAAATAATGAAATTCTCCTCAACTTCGTCCGCAAGATTCGAAAAAAACTGTTTCCCTTCAAAATGCATATCGTGAAAAGCAACAGCGTTGAAGAAGTAGGGGAGGGCAATGGCGGCTGGACCGATGTGAGGGATCATGAACTATGTCTTTTTATTAGTAATCTTAGAAATGTAACTCCCTGA

Protein sequence:

>DPOGS204435-PA
MLDGCVMTKKIFVYPVVACAMLIYVYLKVTSRLSNSPIIIPYNNEKIPTKIQAKLAKKDIDYILHLKNVINDANRKQSTEDDEMDWDFSFIIVIQVYRGINNLNHLVDGLKQAKGIGTALLLFSHSFYSKDINRVVRNISFARVLQIYYPYSVQLHPFVFPGYDQKTCDEVMSCRKSDRKALVIQHKHHWWWSASFVFDRVFATRNYNGTILFLEENQYVTSDFIYIFRILDGLLRSSNLHCDIISLGNGKSKSSSYRLINSSILLKPWDTRETLGIAFNKDTWKTVKNLSEHFCRYNDNRWSKSLIHLSTKTSLGKFYALSIEGSRVFRLNKCKDHKTCNETKNNEILLNFVRKIRKKLFPFKMHIVKSNSVEEVGEGNGGWTDVRDHELCLFISNLRNVTP-