Monarch geneset OGS2.0

DPOGS206689
TranscriptDPOGS206689-TA939 bp
ProteinDPOGS206689-PA312 aa
Genomic positionDPSCF300048 + 1270392-1271582
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0088401e-7066.67% 
BombyxBGIBMGA008522-TA1e-6862.50% 
Drosophilabeta4GalNAcTB-PA5e-6743.28% 
EBI UniRef50UniRef50_E3XGC32e-6544.05%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XGC3_ANODA
NCBI RefSeqXP_002054796.11e-7547.13%GJ22594 [Drosophila virilis]
NCBI nr blastpgi|1953922982e-7447.13%GJ22594 [Drosophila virilis]
NCBI nr blastxgi|1953922984e-7147.73%GJ22594 [Drosophila virilis]
Group
Gene OntologyGO:00167571.1e-85transferase activity, transferring glycosyl groups
GO:00059751.1e-85carbohydrate metabolic process
KEGG pathwaycin:1001750417e-46 
 K07966 (B4GALT1)maps-> Galactose metabolism
    Glycosphingolipid biosynthesis - lacto and neolacto series
    Glycosaminoglycan biosynthesis - keratan sulfate
    N-Glycan biosynthesis
InterPro domain[85-312] IPR0038591.1e-85Galactosyltransferase, metazoa
Orthology groupMCL25498 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206689-TA
ATGCGTTACTATGGAATTAAGAAGAAAATTTTGTACTGTGGATTAATTGTGCTCTTGTTTTTCATACTTCTTCATCCCTCCAGAAATGCAAAAGGAACCTATGAGTTTATTGAAAGGGAGCATATTTTGAGCAATTTGCTCTATGACACTGCTGAAAACTTTTCTAGTACTGCAATTGTTGAGTGTGATTACTACAATGTTGTTTATGATGATTCCACTCTTTCTATCAGTATTGCTGACGGTGATTTAGTTGAGAATCATAGGGTTAAAGATGGAGGGGAATACGTACCTGTCGAGTGCAGACCAAGTCTTAGCACTGCTATCATTATTCCATATAGGGACAGAGCTGAACAATTACGCGCTTTCTTGGTCTATATGCACATGTTTCTAAGAAGACAGTTTATCCACTACAGGATCTATGTTGTGGAACAAGTTGATAGTAAGCCATTTAACACGGCTAAGCTAATGAATATCGGGGCTGCAGCGGCAATACGCGCAGGTTTTCCTTGCCTGGTATTGCATAATGTGGATCTACTCCCACTTAGGCCAGCTAATTTGTATGCCTGTACAAAACTTCCTCGACATCTATCATCTAGTATCAATAAATTGAGATTTGTCCTTCCACACCAAAATGTATTTAGTGGTGTTGTGTCAATATCTTCGAAACAATTTAAACTTATAAATGGAATGACAAATGGAAACACTGGCGATAAAAGTGATCTTCACAATCGTCTTAAAGTTGCAGGTATAAAAATTACTCGTTATGAACCTTCATTGAGTCGTTATTATATGTCTTCACAGAAATTACAACGGAAAGTGATAAGATTCAATCGAGATATGAAACAAGAAATGAAAAAAGATGGTCTGAATTCATTGATGTACACAGAAGTTGCTACCGTCCTACATCCATTGTTTACTCATATCATGGTTGATTTGTAA

Protein sequence:

>DPOGS206689-PA
MRYYGIKKKILYCGLIVLLFFILLHPSRNAKGTYEFIEREHILSNLLYDTAENFSSTAIVECDYYNVVYDDSTLSISIADGDLVENHRVKDGGEYVPVECRPSLSTAIIIPYRDRAEQLRAFLVYMHMFLRRQFIHYRIYVVEQVDSKPFNTAKLMNIGAAAAIRAGFPCLVLHNVDLLPLRPANLYACTKLPRHLSSSINKLRFVLPHQNVFSGVVSISSKQFKLINGMTNGNTGDKSDLHNRLKVAGIKITRYEPSLSRYYMSSQKLQRKVIRFNRDMKQEMKKDGLNSLMYTEVATVLHPLFTHIMVDL-