Monarch geneset OGS2.0

DPOGS214517
TranscriptDPOGS214517-TA1782 bp
ProteinDPOGS214517-PA593 aa
Genomic positionDPSCF300287 - 359351-369336
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0178331e-17775.33% 
BombyxBGIBMGA010965-TA0.073.64% 
DrosophilaExt2-PB4e-14748.24% 
EBI UniRef50UniRef50_B0X0535e-15746.40%Exostosin-2 n=7 Tax=Coelomata RepID=B0X053_CULQU
NCBI RefSeqXP_976077.17e-16949.75%PREDICTED: similar to exostosin-2 isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|910764241e-16749.75%PREDICTED: similar to exostosin-2 isoform 2 [Tribolium castaneum]
NCBI nr blastxgi|2420201167e-16850.79%Exostosin-2, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00167581.2e-45transferase activity, transferring hexosyl groups
GO:00312271.2e-45intrinsic to endoplasmic reticulum membrane
GO:00160205.1e-45membrane
KEGG pathwaytca:6592872e-168 
 K02367 (EXT2)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[449-588] IPR0153381.2e-45EXTL2, alpha-1,4-N-acetylhexosaminyltransferase
[101-374] IPR0042635.1e-45Exostosin-like
Orthology groupMCL15172 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214517-TA
ATGGCTGCGGAGACCCTTCTAAAAACGAGCAAGTATTCAAAGTATACGTATAGGCAAATTGTGTATCATAGATTTTTTGTCGGGCTTCTATTATTTATCTTGATATCATTAGTATTAACTGTAGTGTTTAATCTTTTCGCGGGAAGTGCGCCTCATGCTGATATTTATGAGTCTGTGAATTTGGAAGCATTGAGTCTACCAATTCGTAATATTGTGAGTAGATCGAGATCTGCCGATCCACGCTGGACGAACTGCACTTACTGGCATTGTTTTAATGTTTACAAATGTGGACGAAGAGGTCACAACAAGATAACCATATATATCTATCCCTTAACAGAATACAGGAATGAGAATGGGAAGGCTATTTCTCATTTTTCTAGAGAGTTTTATGAAATTTTAAGTACAATAAAACGTAGTAAATATTACACCCCGAATCCAGAAGATGCCTGCCTCTTGGTGCCTAGTATAGATACATTAAATCAGATAGGTTTTTCTAGTGAATATGTTTCCAAAGCTCTGCAGTCACTCGAGCATTGGAACAATGGAGAAAATCACCTTATATTTAATATGGTGGCGGGAGTGTCGCCGAATTATAATACAGTTATTGATCTCAACACCAGCAAAGCCATTATAGCGGGAGCTGGCTATGACACTTGGACATTTAGATACGGTTTTGATATATCCATACCTCTATACAGTTATATAGCACAAAGAATAAATAGCTCTCAGCCCAAACAGAAGAGTTTTATGATAATATCCTCACAAACAAACATACCTAGCGATTATCTAGCTCAGCTACAGAGTATAGCGTCTTCATCAAATGATTTACTGCTTCTAGATAGATGCAAAGATGCTAGCACCGATTATACTAAACGTTGTGAATATACAACGGGCAAAATGTTTGATTATCCGGATGTATTAAAGGAGGGTATGTTTTGTCTTGTGGTACGAAGTGCGAGGCTCGCTCAGCCAGTTTTGATGGATGTCATAGCATCACAGTGTATACCGATCATTATAGCAGATGCAATAATTATGCCATTTAATTCACATATAGATTGGAATAAAATAGCATTGTTTGTACCCGAAGAGAATATAAAGAATTTGGTACGGATAGTACATTCAGTGAGTAAGGAACGAAAAGGTGAAATGTATTGGCAGTTACGTTGGGTTTATGAGAGATATTTCTCTAGCATAGAGAAGCTTACATTGACTACTCTGGAGATAATCAACGAGAAAGTCTTTCCGTTGTCAGCTAGGATGTACGAAGATTGGAACGTTCCTGAACATTTGTACGGGCCAGTCAACCCGCTGTTCTTGCCGGTGACCGCGCCCAAGTCCCCGGGCTTCACCGCTGTCATCCTGACCTATGACCGGGTGGGCAGTCTGTTCACACTCGTCAGACAACTGGTCCGAACGCCCAGCCTCGCCAAGATACTCGTGATATGGAACAATCAGAAGAAACCTCCACCTCCCTCTTCCGAGTGGCCGGTCGTCAATAAGCCGCTGAAAGTTATAAGGACTAAGGAGAACAAACTGTCAAATAGATTCTTTCCGTACGATGAGATCGATACAGAATGCCAGTTGACGATTGACGATGATATTATCATGTTGACCCCTGATGAACTTGAATTCGGCTTCGATGTCTGGCGCGAGTTCCCCGACCGCATAGTGGGCTTCCCGTCCAGGCTTCACGTGTGGGATAACACTACACACACATGGAAATACCACAGCGAGTGGACCAATCAGATATCTATGGTGAGAACAGTAGAATTTTCTGACTGA

Protein sequence:

>DPOGS214517-PA
MAAETLLKTSKYSKYTYRQIVYHRFFVGLLLFILISLVLTVVFNLFAGSAPHADIYESVNLEALSLPIRNIVSRSRSADPRWTNCTYWHCFNVYKCGRRGHNKITIYIYPLTEYRNENGKAISHFSREFYEILSTIKRSKYYTPNPEDACLLVPSIDTLNQIGFSSEYVSKALQSLEHWNNGENHLIFNMVAGVSPNYNTVIDLNTSKAIIAGAGYDTWTFRYGFDISIPLYSYIAQRINSSQPKQKSFMIISSQTNIPSDYLAQLQSIASSSNDLLLLDRCKDASTDYTKRCEYTTGKMFDYPDVLKEGMFCLVVRSARLAQPVLMDVIASQCIPIIIADAIIMPFNSHIDWNKIALFVPEENIKNLVRIVHSVSKERKGEMYWQLRWVYERYFSSIEKLTLTTLEIINEKVFPLSARMYEDWNVPEHLYGPVNPLFLPVTAPKSPGFTAVILTYDRVGSLFTLVRQLVRTPSLAKILVIWNNQKKPPPPSSEWPVVNKPLKVIRTKENKLSNRFFPYDEIDTECQLTIDDDIIMLTPDELEFGFDVWREFPDRIVGFPSRLHVWDNTTHTWKYHSEWTNQISMVRTVEFSD-