Monarch geneset OGS2.0

DPOGS208641
TranscriptDPOGS208641-TA2691 bp
ProteinDPOGS208641-PA896 aa
Genomic positionDPSCF300281 - 236414-242096
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0117460.080.72% 
BombyxBGIBMGA007776-TA0.076.37% 
Drosophilabotv-PA0.061.56% 
EBI UniRef50UniRef50_Q9XZ080.061.56%Exostosin-3 n=18 Tax=Coelomata RepID=EXT3_DROME
NCBI RefSeqXP_397082.10.060.26%PREDICTED: similar to brother of tout-velu CG15110-PA [Apis mellifera]
NCBI nr blastpgi|3838628540.060.04%PREDICTED: exostosin-3-like [Megachile rotundata]
NCBI nr blastxgi|3838628540.060.15%PREDICTED: exostosin-3-like [Megachile rotundata]
Group
Gene OntologyGO:00167582.5e-87transferase activity, transferring hexosyl groups
GO:00312272.5e-87intrinsic to endoplasmic reticulum membrane
GO:00160205.6e-46membrane
KEGG pathwaydpo:Dpse_GA134990.0 
 K02370 (EXTL3)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[639-881] IPR0153382.5e-87EXTL2, alpha-1,4-N-acetylhexosaminyltransferase
[190-476] IPR0042635.6e-46Exostosin-like
Orthology groupMCL11247 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208641-TA
ATGGTTACAGTGATTCTGTTTGTTGTGCCGCTCTTTACACATTATTATTTGTCTAAGTATGAATCATCATCAATGACATTGGGTTCCAATAACATGCGACACACACTAGAAGCCCTCGGAGACTTATCAGCCGTCAATATCGGAGACCTTAAGATAAGGATAGAAGAAATGCTTAGAATTAAGGCGTCAGTGTCCACGGAGTTGCGTGAATTAGAAGAACGACGAGGGAAACTGCAAAAAGAAGCGGCAGCTGCTAGTGCAAACGCAGACAGTGTTAAGGCTGAGTATGCACGCGCGACTGCTGAATTGCAGAGATTGAGGGTATCCGCAGACCAAGCTCGACTAGCCCAGTTGGAGGCTATACGACGGGATTCCCCTGAACTCGCCCCACCACTGCCAATCTTACCTTCGTCCCCACCACCCATTCTCCCACCTGCTACATCAACATCTGAACTACACTGTCGAATGCATTCATGCTTTGATCACTCCCGCTGTTCACTAACTTCCGGTTTCCCAGTGTACTTCTATGATCCTGATGTATTCTCTCCTCTCATTGGGGCGGAGGTGGATGGTTTTCTCAAAACCACATTACGACAAACGTTAAGCTACAATTCACACCTTACTCAAAACCCTAATGAAGCATGCGTCTATCTCGTGCTGGTCGGCGAAGGATTTCCTTCTGACAAGACTCAAACTTCCACGAAAAAGCTGTTGTTGAATGAGACAGCAATCAAAAGTCTGCCATATTGGGGCGGAGACGGGCGTAACCATGTGTTACTAAATCTGGCTCGTCGCGATCTATCCGTCGGTTCCGGAGACGCGTTTCTGGATTCGTCGACTGGTAGAGCGATGATAGCGCAGTCTACGTTTACATTGCAACAGTTCCGGCCAGGATTTGACCTGGTGACACCTCCAGCCCTCGGACCTCCTGGAGGAGACGTATGGTCAGACTGCGCGCCTATGGCACCGGCAAGACGTCTATACATACTTAGTTTTCAGGGTTCACAGACTCCAGCGGCAGGGTCCCACGTAGATGACGATCAGTCACTCATCGAGTCTCTGAGGAAGATGGTCAGCCAGGCTCCTTCTTCTGATGTGTTTCTATTGCAATTCGACTGCGACCCGCCTATCGACAAGCGTGCGGTCCTTCCGATCGGTGACTGGGGACTCTGCGGCACCGATCGGTCGAGACGAGCCGTTCTTAGAGATTCCACTTTCGTATTAATATTGGCACCGGCTGACGGAGATTATGCTTCAACAGCTCTCCTGCAAGCGAGGCTATATGAAGCGCTACGCTCCGGAGCTATACCCGTCATACTTGGGGGTGATCGTATACAGCTGCCGTATAGCGAAGTTTTAGACTGGCGAAGGGCTACATTATCCCTCCCGAAAGCTCGCGTCACTGAGTTACATTTTCTGCTGAGAGCTCTATCGGATGCAGATTTACTAGCGTTCCGTAGACAGGGACGTTTGTTATGGGAGAGATATTTAAGTTCGGTACAAGCTAGTATGGACTCGCTCCTGGCTACTATACGGACTCGTTTGAACATTCCTCCACATTCAGCGGCACCGACTATGGGTGTGCCGGCGTTCAATGACACCTTCTATCCACCGAAAATTGAACCGCCGGCCGTGGACACTGAGCCCGAAGAGACCCTCGGGCCTTTAGAAGCTCCTTATCCGAGTCCGGCCTATAGACGTAATTACTCGGTGTCTCTATTAAACGGTTACGAACTATGGAATGACTGGGGAGAGCCGTTCGCACTGTTTCCTCAATTGCCTTGGGATCCGCCGGTAACATCGGAAGCCCGGTTCATGGGTTCCGCAGCAGGTTTCCGACCAATCGGAGCAGGAGCCGGGGGTTCTGGGAAGGAGTTCAGCGAAGCTCTAGGAGGTGACCGGCCGAGGGAACAGTTCACTATTGTCATCCTCACGTATGAGAGGGAAGCCGTTCTGGCAGCGGCACTGGCGAGGCTCCGGGGTCTACCGTACTTGAATAAGGTGGTGGTTGTATGGAACGGAGTGAACCCACCACTCTCGTCCCAGTCGTGGCCGGAGTCGGGCGCGCCGGTGGCGGTGGTGCGGGCTCCTCGCAACTCATTGAACAACCGCTTCCTACCATACAACGTGATCGACACTGAGGCCGTTCTCTGCGTAGACGATGACGCGCATTTGAGACACGATGAGATAGTCTTCGCGTTTAGAGTCTGGCGTGAACATCGCGATCGTATAGTGGGCTTCCCTGGGAGGTACCACGCGTGGGATCTCAACTTCAATAATGGATTCCTTTACAACTCTAACTACAGTTGTGAGCTGAGTATGGTGTTAACCGGGGCGGCGTTCGTGCACCGCTACTATTTGTGGTCGTACTGGCGTCTGCTGCCCGCCGCTGTCCGGGACTACGTCGACCAGTACATGAACTGCGAGGACATCGCTATGAACTTCCTAGTGGCTCACATCACGAGGAAACCGCCGGTCAAGGTGACATCTCGTTGGACGTTCCGTTGTCCTGGTTGCCCTGTGACGCTGTCAGCGGACGAGACCCATTTCCACGAGCGACACAAATGCATTCAGTTCTTCTCCCAGGTGTTTGGTTACACTCCACTTCTGTCGACACAGTACAGAGCTGATTCCGTACTTTTTAAGACGAGGATATCACACGACAAGCAGAAGTGCTTTAAATTCATTTAA

Protein sequence:

>DPOGS208641-PA
MVTVILFVVPLFTHYYLSKYESSSMTLGSNNMRHTLEALGDLSAVNIGDLKIRIEEMLRIKASVSTELRELEERRGKLQKEAAAASANADSVKAEYARATAELQRLRVSADQARLAQLEAIRRDSPELAPPLPILPSSPPPILPPATSTSELHCRMHSCFDHSRCSLTSGFPVYFYDPDVFSPLIGAEVDGFLKTTLRQTLSYNSHLTQNPNEACVYLVLVGEGFPSDKTQTSTKKLLLNETAIKSLPYWGGDGRNHVLLNLARRDLSVGSGDAFLDSSTGRAMIAQSTFTLQQFRPGFDLVTPPALGPPGGDVWSDCAPMAPARRLYILSFQGSQTPAAGSHVDDDQSLIESLRKMVSQAPSSDVFLLQFDCDPPIDKRAVLPIGDWGLCGTDRSRRAVLRDSTFVLILAPADGDYASTALLQARLYEALRSGAIPVILGGDRIQLPYSEVLDWRRATLSLPKARVTELHFLLRALSDADLLAFRRQGRLLWERYLSSVQASMDSLLATIRTRLNIPPHSAAPTMGVPAFNDTFYPPKIEPPAVDTEPEETLGPLEAPYPSPAYRRNYSVSLLNGYELWNDWGEPFALFPQLPWDPPVTSEARFMGSAAGFRPIGAGAGGSGKEFSEALGGDRPREQFTIVILTYEREAVLAAALARLRGLPYLNKVVVVWNGVNPPLSSQSWPESGAPVAVVRAPRNSLNNRFLPYNVIDTEAVLCVDDDAHLRHDEIVFAFRVWREHRDRIVGFPGRYHAWDLNFNNGFLYNSNYSCELSMVLTGAAFVHRYYLWSYWRLLPAAVRDYVDQYMNCEDIAMNFLVAHITRKPPVKVTSRWTFRCPGCPVTLSADETHFHERHKCIQFFSQVFGYTPLLSTQYRADSVLFKTRISHDKQKCFKFI-