Monarch geneset OGS2.0

DPOGS208558
TranscriptDPOGS208558-TA1188 bp
ProteinDPOGS208558-PA395 aa
Genomic positionDPSCF300064 + 1139802-1141501
RNAseq coverage265x (Rank: top 40%)
Annotation
HeliconiusHMEL0021900.082.53% 
BombyxBGIBMGA010664-TA1e-14280.07% 
DrosophilaCG5287-PA2e-13259.59% 
EBI UniRef50UniRef50_Q7Q4061e-12962.18%AGAP008131-PA n=7 Tax=Coelomata RepID=Q7Q406_ANOGA
NCBI RefSeqXP_968050.14e-14965.27%PREDICTED: similar to CG5287 CG5287-PA [Tribolium castaneum]
NCBI nr blastpgi|910804198e-14865.27%PREDICTED: similar to CG5287 CG5287-PA [Tribolium castaneum]
NCBI nr blastxgi|910804191e-14863.54%PREDICTED: similar to CG5287 CG5287-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160211.5e-33integral to membrane
GO:00089631.5e-33phospho-N-acetylmuramoyl-pentapeptide-transferase activity
KEGG pathwaytca:6564241e-148 
 K01001 (ALG7)maps-> N-Glycan biosynthesis
InterPro domain[90-260] IPR0007151.5e-33Glycosyl transferase, family 4
Orthology groupMCL13975 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208558-TA
ATGTGGTCAATAATAATTTTAATAATATACTGTGTTATTGCTTATTTAATCACTGACGAATTAATACCAAAGTTAAAGCATTTATTTATTAATGCTGGTTTATATGGCATAGATTTATGCAAAGTCTCACAAGAGAAAATTCCAGAAGCCCTTGGTGTTGTGTCGGGATGTATATTTTTGGTTACAATATTCTTATTTATACCAATAGCTTTTGGAAATGATTTGATGGATAGGGGAAGCTTTCCCCATAATGAGTTTGCGGAACTACTAGCAGCTTTACTCTCTATATGTTGTATGTTGCTACTGGGCTTTGCTGATGATGTATTGAACCTCAAGTGGAGATACAAACTTCTCCTGCCAACAGTCGCATCACTCCCATTGTTGGTTGTGTATTATGTTAACTTCAACTCAACAACTTTTGTTGTGCCACTCCCATTGAGGCATTTTTTTGGAGTTTCTGTGAATATCGGTTTTCTGTATTATATATATATGGGAATGCTGGCGGTTTTCTGCACAAATGCAATTAATATTTTAGCTGGAATAAACGGTCTTGAAGTAGGCCAGTCACTAGTTATAGCTTTGTCCATAATAATTTTCAATTTACTTGAGCTAAAAGGAGATCAATTCAAAGCTCACTACTTTTCATTGCATATTATGATACCTTATCTTTCTACTACATTGGCTTTATTCAAGCATAATTGGTACCCTTCAAGAGTATTTGTTGGTGATACCTTCTGTTATGTGTCAGGGATGACATTTGCCGTAGTAGGCATACTTAGCCACTTTAGTAAGACTGTCCTTTTGTTCTTCCTACCCCAAATTATTAATTTTTTGTACTCAGTACCACAACTATTTCATATTATCCCCTGCCCAAGACACAGACTACCTAAGTATAGCGCAGAAACAAATTTGCTCCAAGCAAGCAGGACAGTTATTCCAAAAAAAGATCAGAAATATCTTAGCAAAAAAATTGTAGTGGTACTATCATTTTTTCGTTTAATTGATAAACTTGAAGATGATGCATCTATAGTAATGAATAATATGACACTGATCAATCTGTTTTTGATTAAGTTTGGGCCAATGTCTGAGGTTAGATTGACTGTATTGTTACTAATGTTTCAAGTGTTATGTACATGTGTAGCATTCATTATAAGGTATCCATTGGCTTCATATTTTTATGATATTTAA

Protein sequence:

>DPOGS208558-PA
MWSIIILIIYCVIAYLITDELIPKLKHLFINAGLYGIDLCKVSQEKIPEALGVVSGCIFLVTIFLFIPIAFGNDLMDRGSFPHNEFAELLAALLSICCMLLLGFADDVLNLKWRYKLLLPTVASLPLLVVYYVNFNSTTFVVPLPLRHFFGVSVNIGFLYYIYMGMLAVFCTNAINILAGINGLEVGQSLVIALSIIIFNLLELKGDQFKAHYFSLHIMIPYLSTTLALFKHNWYPSRVFVGDTFCYVSGMTFAVVGILSHFSKTVLLFFLPQIINFLYSVPQLFHIIPCPRHRLPKYSAETNLLQASRTVIPKKDQKYLSKKIVVVLSFFRLIDKLEDDASIVMNNMTLINLFLIKFGPMSEVRLTVLLLMFQVLCTCVAFIIRYPLASYFYDI-