Monarch geneset OGS2.0

DPOGS204299
TranscriptDPOGS204299-TA1278 bp
ProteinDPOGS204299-PA425 aa
Genomic positionDPSCF300046 + 459921-461198
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0151750.079.30% 
BombyxBGIBMGA007575-TA4e-1694.74% 
Drosophilal(2)not-PA1e-12658.06% 
EBI UniRef50UniRef50_Q273332e-12458.06%Lethal(2)neighbour of tid protein n=19 Tax=Diptera RepID=NT56_DROME
NCBI RefSeqXP_565033.35e-14259.21%AGAP007168-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582861451e-14059.21%AGAP007168-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571292863e-14859.91%hypothetical protein AaeL_AAEL002483 [Aedes aegypti]
Group
Gene OntologyGO:00057835.4e-170endoplasmic reticulum
GO:00167585.4e-170transferase activity, transferring hexosyl groups
GO:00160215.4e-170integral to membrane
KEGG pathwayaga:AgaP_AGAP0071682e-141 
 K03845 (ALG3)maps-> N-Glycan biosynthesis
InterPro domain[1-411] IPR0078735.4e-170Glycosyltransferase, ALG3
Orthology groupMCL13741 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204299-TA
ATGCAAGAATGTGAAGGATTTCTAAATGGAACCCTCGATTACAGTAAACTTCGTGGTGACACTGGCCCATTGGTGTACCCTGCAGGTTTCGTCTACATCTATTCTATATTCTATTTTCTCACTAACCATGGTACCAATATAAAACTTGCTCAATATATATTTATTTTCATCTATCTTTTGTTATTAACTTTAGTCTTAAGAATTTATAAGAAAACTCGGAAAGTGCCTCCATATGTATTAGTAATAACAATTTTAACTTCATATAGAATACATTCAATCCATGTTTTACGTATGTTTAATGACCCTGTTGCTGTTCTGTTTCTGTATGCATCCCTTAATTTCTTTCTAGATTCGAAATGGTATTTAGGTAGTCTCTTTTACAGTTTAGCTGTTTCTATAAAGATGAATATACTTCTTTATGCCCCAGCACTATTTTTCTTTTATCTAGTAAACCTAGGGCTAAAAGGAACAATACAGCAGTTACTACTGTGTGGAGTAACACAATTAGTATTAGGGATGCCATTTTTACTAGTAGCTCCTATAGCATACATTAAGGGTAGTTTTGACCTGGGTAGAGTATTTAACCACACTTGGACTGTTAACTATAGATTTTTAGACATAAAAACATTTGAGAGTAAATTCTTTCATTTAACATTGTTAGGTATTCACATGATGCTTTTAATTTTATGTTTACCAATGTGTATAAAGTATTTTCAAAGTTATTGCCGGTTGAAGTATGTACAAAGACAGGTACAGCCCCAAATTGATGCTAAAAACAGAGAAAATAAGAAAAGAGCAAAGTTGAGAAAAGACATCAAAAGTAATTTGAATCAACCTGACGAAATTCTTTCTAAAGAACAAGAAGCATTCCTTAATTCTTTTGAAGCTATGCTAAAAAATTCATCACAAAAAAGCAAACAAGACAAAGTGATCAAAGAACATGAAAAGGAAAAACATTTTAGTATTAATTTTGATATCTTATCTCAACTGTTCATTCTTCCAATGTTTTTAGTTAATTTTATAGGTATTGTATGTGCTAGAAGTCTGCATTATCAATTCTATTCATGGTATTTCCATACTTTACCTTATTTATTATGGTGTACTAATTATTCTGTTATTGTAAGATTCTTAATATTAGCATTAATAGAGCTATGTTGGAATACTTATCCTAGTACAGATATCACAAGTGCACTTCTACATGTATGTCATATCAGTATATTGTATGGAGTTTATAAGAAAATGGCTATAGAATTAAACATTACGTCAAAACTAACATAG

Protein sequence:

>DPOGS204299-PA
MQECEGFLNGTLDYSKLRGDTGPLVYPAGFVYIYSIFYFLTNHGTNIKLAQYIFIFIYLLLLTLVLRIYKKTRKVPPYVLVITILTSYRIHSIHVLRMFNDPVAVLFLYASLNFFLDSKWYLGSLFYSLAVSIKMNILLYAPALFFFYLVNLGLKGTIQQLLLCGVTQLVLGMPFLLVAPIAYIKGSFDLGRVFNHTWTVNYRFLDIKTFESKFFHLTLLGIHMMLLILCLPMCIKYFQSYCRLKYVQRQVQPQIDAKNRENKKRAKLRKDIKSNLNQPDEILSKEQEAFLNSFEAMLKNSSQKSKQDKVIKEHEKEKHFSINFDILSQLFILPMFLVNFIGIVCARSLHYQFYSWYFHTLPYLLWCTNYSVIVRFLILALIELCWNTYPSTDITSALLHVCHISILYGVYKKMAIELNITSKLT-