Monarch geneset OGS2.0

DPOGS203381
TranscriptDPOGS203381-TA1365 bp
ProteinDPOGS203381-PA454 aa
Genomic positionDPSCF300003 + 479018-484930
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0072865e-10444.39% 
BombyxBGIBMGA002045-TA3e-7838.67% 
DrosophilaCG9384-PA2e-8034.73% 
EBI UniRef50UniRef50_D6WG023e-7936.71%Putative uncharacterized protein n=4 Tax=Tribolium castaneum RepID=D6WG02_TRICA
NCBI RefSeqXP_313212.42e-8437.94%Anopheles gambiae str. PEST AGAP012440-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582916933e-8337.94%Anopheles gambiae str. PEST AGAP012440-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582916934e-8137.56%Anopheles gambiae str. PEST AGAP012440-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160207.9e-84membrane
GO:00167587.9e-84transferase activity, transferring hexosyl groups
GO:00059757.9e-84carbohydrate metabolic process
KEGG pathwayaga:AgaP_AGAP0124404e-84 
 K00738 (MGAT4A_B)maps-> N-Glycan biosynthesis
InterPro domain[44-410] IPR0067597.9e-84Glycosyl transferase, family 54
Orthology groupMCL10892 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203381-TA
ATGAAATTGAACCGAGGAAACCTGTCATGTGATACAACGCCTGAGGTGAATCAAAATGGTACGAAATGGGACCTCGGGGCGAATCCTAAACATGCAAAGCCGGTCATGAATTCTTTCGCCTATCAACTAATGCCCCATTTGAATGACCATCCATCCAGCCTGAAGCCAGCATTCCTACTGAATGGCACTCGGGAGAATTGTGATGTGGTTATCGGAATTCCAACTGTTCACAGAGATAAAGGATCTTATATTATATCAACGGTTTCGAATGTTATCCATGGTATGACTGATGAGGAGAAGGATAAAAATCTTATCGTTGTTTTGGTCGCTGAAACGCAAATGGACTACGTCCGTCCATTAGTGGAAAAACTTGCAACACTGTTTTCGGAATACATTTATACTGGTCTTGTCGAATTGGTAGTACCATCTCCTTTCTATTATCCGGATTTTGATAATTTACCTTTGAACTTCGGGGATTCGAACAAGCGCGTCATGTGGCGTACAAAACACAATCTCAACGTCTTGTATGTCATGGCCTATGCACGCTCTAGGGGAACTTATTACCTGATGCTTGAAGACGATGTCACTGTTAAGGAGAGGTTCATCGAGCATATGTTGGATTTTGCAAAGGCCAAAACTAAATCAAATCCTGATTGGTATGTGCTTGAATTTTGCAACATCGGCGGTATTGGGAAGTTGTTTAAAACATCTGATCTAATTTACTTCATGACCTATATCCAACTGTTCTACAAACAAATGCCCATTGATTGGCTTCTGGAGAGTTACATTGCTGACAGCTCTTGCTCTCTTGACCGGGCCACGCCTGGTTATAATGAAGCCCGATCCTTATTCTTCCCGCACGACAACCCGGCAGTCCAACGAGTTTTTACCGATATAAAAATATTCAAGAATCATACGCCTATGAAAGCATACGACGGAAAAACTTTCTTCTGGGGAATAAACCCGGTGAAAGGAAACGTCGTGGAATTTTGGTTTAGGGAGCCCACTAATATTGTCAGATATACATTCCGAAGTGGCAATTTTGACCGTTACGGCGATATATTTGACAACGCCGTAGTCGAAGCTCTTCCGATTAGGAAACGAAAGTTTGTCATGGTGAAAAGGTTTGACGAATTAGGTTTTGCCAGTGGGGACCTAAATTTAGGAGCTTTGGTTGCTATTAGAATTCGAGTTACCAAAAACAGCACTCACTGGGTAATTCTATCTGAGCCTTTGGCTTATATTTTTTACGGTTATAACGCTTTGGACTGTTCGTTGGAATCATTGGAATCAACACAGACGTATCTACAAGATGTTAAGCGTCTGTCTCCGAAGAAGTACCTTCCACCGGACACGACAGCGTGA

Protein sequence:

>DPOGS203381-PA
MKLNRGNLSCDTTPEVNQNGTKWDLGANPKHAKPVMNSFAYQLMPHLNDHPSSLKPAFLLNGTRENCDVVIGIPTVHRDKGSYIISTVSNVIHGMTDEEKDKNLIVVLVAETQMDYVRPLVEKLATLFSEYIYTGLVELVVPSPFYYPDFDNLPLNFGDSNKRVMWRTKHNLNVLYVMAYARSRGTYYLMLEDDVTVKERFIEHMLDFAKAKTKSNPDWYVLEFCNIGGIGKLFKTSDLIYFMTYIQLFYKQMPIDWLLESYIADSSCSLDRATPGYNEARSLFFPHDNPAVQRVFTDIKIFKNHTPMKAYDGKTFFWGINPVKGNVVEFWFREPTNIVRYTFRSGNFDRYGDIFDNAVVEALPIRKRKFVMVKRFDELGFASGDLNLGALVAIRIRVTKNSTHWVILSEPLAYIFYGYNALDCSLESLESTQTYLQDVKRLSPKKYLPPDTTA-