Monarch geneset OGS2.0

DPOGS203672
TranscriptDPOGS203672-TA2172 bp
ProteinDPOGS203672-PA723 aa
Genomic positionDPSCF300010 - 2292879-2296831
RNAseq coverage303x (Rank: top 37%)
Annotation
HeliconiusHMEL0133390.088.35% 
BombyxBGIBMGA003727-TA0.079.38% 
Drosophilattv-PA0.052.93% 
EBI UniRef50UniRef50_Q9V7300.052.93%Exostosin-1 n=17 Tax=Coelomata RepID=EXT1_DROME
NCBI RefSeqXP_001975618.10.053.57%GG20465 [Drosophila erecta]
NCBI nr blastpgi|1948830520.053.57%GG20465 [Drosophila erecta]
NCBI nr blastxgi|1948830520.053.57%GG20465 [Drosophila erecta]
Group
Gene OntologyGO:00167581.4e-68transferase activity, transferring hexosyl groups
GO:00312271.4e-68intrinsic to endoplasmic reticulum membrane
GO:00160203.9e-52membrane
KEGG pathwayder:Dere_GG204650.0 
 K02366 (EXT1)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[454-705] IPR0153381.4e-68EXTL2, alpha-1,4-N-acetylhexosaminyltransferase
[82-363] IPR0042633.9e-52Exostosin-like
Orthology groupMCL12437 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203672-TA
ATGCAAGCAAAAAAACGTTATTGTTTTTTGTTACTTTCGTGTTTATTTTTAATATATTGTTATTTTAATGGTTTCAATTTACTCGGTGAGAAAAAACAACGCGCTCGTCACGATCTGTTGCCGTCTTTTGCGACTTTAGATGAATTAACAGAATCTCCATCAGGCCAGAAAAGATTACCACGAGCAACGACAAAATCATGTCGAATGCAAACATGTTTTGATTTTTCAAATTGTGGAAGCGATCCTAAGGTATACGTTTATCCTACTGACGGTCCTGTAAGTGCTACGTATCGTAAAGTATTGTCTGTCGTTCGGGAGTCTAGATATGCCACACATGATCCTGCTGAGGCATGTTTGTTTATACCTGCTGTCGATACGTTGGACGCTGATCCATTATCTCCTGAACATATACCAGACGTAGCATCAAGATTGTCACGACTACCATATTGGAAAAATGGAAGAAACCATCTCCTATTTAATTTATATGCTGGCACATGGCCTGACTATGCAGAAGGAGCATTGGGTTTTGATCCAGGAGATGCTATATTAGCGAGAGCCAGTGCTTCAGAAACAATATTCCGTGATGGATTTGATATCTCATTACCTCTATTTCATAAAGAACACCCAGAAAGAGGTGGTGTTCCACCTTCAGCTACAGGAAACCCATTTCCAGCACCTCGTAAACATTTGTTAGCATTTAAAGGGAAAAGATATGTTCATGGCATTGGTAGTGAAACAAGAAATTCATTATGGCATCTACATGATGGAAATAATCTGATTTTAGTCACAACTTGTCGACATGGGAAGTCTTGGAAGGATTTAAGAGATGAAAGATGTGACGAAGATAATAGGGAGTATGACAAATTTGATTATGAACAGCTACTGGCAAACTCCACTTTTTGTCTCGTTGCCCGAGGGAGGCGTTTGGGTTCTTACCGTTTCTTGGAAGCTTTGGCTGCTGGCTGTGTCCCCGTGTTGTTAAGCAATGGGTGGAGATTACCTTTCGATGAACGGATAGATTGGCGTCGTGCTGTTATATGGGCAGATGAACGGCTGCTTTTGCAGGTACCAGAGTTGGTGAGGTCAGTTCCTCCTGAGCGTATTCTTGCTTTACGTCAACAAACACAGCTCTTATGGGAACAATATTTCTCTTCAATAGAGAAGATAGTTTTCACAACTATTGAGATACTATTAGAACGAATTATGACTCATAGATCGTCTCGTCAACGTGAGGCTTTGATTTGGAATACTTCTCCGGGAGCTCTTGGTACACTGGCTACATATGGGGACTCCCGTGCCCATTTGCCACTCGCTCCCTCCGTGCCCCTCACCGCTCCGCCACCTACTCCGGGAACGACATTTACAGCACTATTGTACGTACAGGCAACATCCCCATCATTACATAAACTTCTCGTCAGTATTGCTAACAGCCAGTACTGTGAAAAGGTGGTTCTAGTTTGGGACAGCGAACGCGCTGCACCTTCATTGACATCTCTATCAAGAACAGCGGGAGATTCACGCCATCCGCTACCAGTAGTAGTCATCGATGCGACTACACACTATCCCGGTGAAGGTGTGTCTGCTCGTTGGCAGCCTCTGTGGGCGGTTCCCACAGCCGCTGTGTTTTCTTTGGACGGTGACGCACCGCTTTTAGCGGAGGAATTGGACTTCGCATTTCTCGTGTGGCAACACTTTCCAGAACGTATTGTTGGATATCCAGCAAGGAATCACTTTTGGGATGAAGCTAAGGGGTCATGGGGTTATAGCAGCCGATGGGGCGGGTCGTACTCGATGGTGCTAGCGGGGGCTGCGGTGACACATAGGTCACTGTTAGCGCAGTACGCCGCCCTGTCCCCGGCGGTCCGGCTCGCTGTCCGCCGGGCTGGGAACTGTGAGGACATCCTACTGAACTGTCTCGCGTCTCACCTCTCGCGACGACCGCCCATCAAGCTGGCACAACGACGCCGGTACAAGAGTCCCCACCACCGGTACAGGTCATCTTGGAGCGACCCTGAGCACTTCGTCCAGCGTCAGTCGTGTCTGAACACGTTCGCGGCTGCGTGGGGCTACATGCCATTAGTGCGCTCTGTACTGCGACTGGACCCCATACTATTCAAGGATCCCGTCTCCACATTGAGGAAGAAATATAGGAAAATGGAACTTGTAACCTAG

Protein sequence:

>DPOGS203672-PA
MQAKKRYCFLLLSCLFLIYCYFNGFNLLGEKKQRARHDLLPSFATLDELTESPSGQKRLPRATTKSCRMQTCFDFSNCGSDPKVYVYPTDGPVSATYRKVLSVVRESRYATHDPAEACLFIPAVDTLDADPLSPEHIPDVASRLSRLPYWKNGRNHLLFNLYAGTWPDYAEGALGFDPGDAILARASASETIFRDGFDISLPLFHKEHPERGGVPPSATGNPFPAPRKHLLAFKGKRYVHGIGSETRNSLWHLHDGNNLILVTTCRHGKSWKDLRDERCDEDNREYDKFDYEQLLANSTFCLVARGRRLGSYRFLEALAAGCVPVLLSNGWRLPFDERIDWRRAVIWADERLLLQVPELVRSVPPERILALRQQTQLLWEQYFSSIEKIVFTTIEILLERIMTHRSSRQREALIWNTSPGALGTLATYGDSRAHLPLAPSVPLTAPPPTPGTTFTALLYVQATSPSLHKLLVSIANSQYCEKVVLVWDSERAAPSLTSLSRTAGDSRHPLPVVVIDATTHYPGEGVSARWQPLWAVPTAAVFSLDGDAPLLAEELDFAFLVWQHFPERIVGYPARNHFWDEAKGSWGYSSRWGGSYSMVLAGAAVTHRSLLAQYAALSPAVRLAVRRAGNCEDILLNCLASHLSRRPPIKLAQRRRYKSPHHRYRSSWSDPEHFVQRQSCLNTFAAAWGYMPLVRSVLRLDPILFKDPVSTLRKKYRKMELVT-