Monarch geneset OGS2.0

DPOGS211302
TranscriptDPOGS211302-TA1062 bp
ProteinDPOGS211302-PA353 aa
Genomic positionDPSCF300125 - 245506-246782
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0058815e-2830.95% 
BombyxBGIBMGA004959-TA1e-5772.41% 
DrosophilaCG3038-PA4e-6541.83% 
EBI UniRef50UniRef50_Q95RP85e-6341.83%CG3038, isoform A n=18 Tax=Drosophila RepID=Q95RP8_DROME
NCBI RefSeqXP_002071707.13e-6742.32%GK10121 [Drosophila willistoni]
NCBI nr blastpgi|1954485496e-6642.32%GK10121 [Drosophila willistoni]
NCBI nr blastxgi|1571126686e-6444.06%beta 1,3-galactosyltransferase [Aedes aegypti]
Group
Gene OntologyGO:00160206.4e-73membrane
GO:00064866.4e-73protein glycosylation
GO:00083786.4e-73galactosyltransferase activity
KEGG pathwaydpo:Dpse_GA109451e-26 
 K09664 (B3GNT7)maps-> Glycosaminoglycan biosynthesis - keratan sulfate
InterPro domain[53-312] IPR0026596.4e-73Glycosyl transferase, family 31
Orthology groupMCL16446 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211302-TA
ATGACAAAGAAAGTTGTAGCGTTTATACTTATAGCACTGCTATCGTTAGTAGTATGGGTCTATAGAAGTTCTAATTTCGAATACAATAACACCATAGTGTTAATAAAACAAAATACGCTCTTCAAACATGCTTTGATAAATCCTCTGAATACAACTTGCGATCGACATCCCTTGTTTGTGATAATAGTAACTTCATATGTTGGTCATGTAGAGTTGAGGAGTGCTCATAGACGAGCCATGCCATTAGAATACTTAGCTGCCTATAATGCTACAAGAGTTTTTCTATTAGCAAAAATACCTGGAAATGAGAGATATATTACCCAAGAAGCAATCAATGATGAGAGCAATACATTCGGTGACATTTTACAAGGACAGTTCTATGAAAATTACAGAAACCTTACATACAAACATCTCATGGGATTGCAATGGGCATCTACAACATGCAGTACAGCAACCTTCATTTTAAAAGTCGATGATGACACAGTATTTAATTTTGATCGAACATATGAATACATTAAAACATTATCTACAAATAAAAGCAACTCCTTAATAGGATACATTTTAAACAACACTCAGCCAAGAAGAAACACTGAAAACAAATGGTTTGTAACCTATGAGGAGTATCCAAGAAGTGTCTACCCTCAGTATCTGTCAGGATGGTATTACATAATAACTCCAGACGCTGCAAGAATAATAAGTCAAGAAGCTACTTACCATCCTTACTTCTGGATAGATGACATTTTTGTCACGGGGCTTCTCACTGAAAGCCTCGGTCTTAAATTAAAACAACTACCATTAAACTACTGGTTAGAATACTATGAACTACTCGAATGCTGCCTTCGTGATATGATAAAGAAATCTATTGTTTGTGACTACACTGTTGGTCCTAATGGCAGCAGAAACAATTTGATTGTAGAATTTAATGAGGCATATAGAAATTGCCACAAATGGGGTAATTGTACCCGACCAAATGATTACAATTTGAAGAAAGAATGTGTTGTAAGTAGGGAAAGGACTATTTTTAGTGATGGGAAAGCTATAATCGACCATGAAATACTCTGA

Protein sequence:

>DPOGS211302-PA
MTKKVVAFILIALLSLVVWVYRSSNFEYNNTIVLIKQNTLFKHALINPLNTTCDRHPLFVIIVTSYVGHVELRSAHRRAMPLEYLAAYNATRVFLLAKIPGNERYITQEAINDESNTFGDILQGQFYENYRNLTYKHLMGLQWASTTCSTATFILKVDDDTVFNFDRTYEYIKTLSTNKSNSLIGYILNNTQPRRNTENKWFVTYEEYPRSVYPQYLSGWYYIITPDAARIISQEATYHPYFWIDDIFVTGLLTESLGLKLKQLPLNYWLEYYELLECCLRDMIKKSIVCDYTVGPNGSRNNLIVEFNEAYRNCHKWGNCTRPNDYNLKKECVVSRERTIFSDGKAIIDHEIL-