Monarch geneset OGS2.0

DPOGS200524
TranscriptDPOGS200524-TA1794 bp
ProteinDPOGS200524-PA597 aa
Genomic positionDPSCF300119 - 436366-441665
RNAseq coverage572x (Rank: top 22%)
Annotation
HeliconiusHMEL0054877e-5948.12% 
BombyxBGIBMGA009342-TA6e-12058.21% 
DrosophilaCG12006-PA9e-9342.79% 
EBI UniRef50UniRef50_Q9VZM51e-9042.79%GPI mannosyltransferase 3 n=14 Tax=Diptera RepID=PIGB_DROME
NCBI RefSeqXP_002007203.17e-9345.16%GI12509 [Drosophila mojavensis]
NCBI nr blastpgi|1951254751e-9145.16%GI12509 [Drosophila mojavensis]
NCBI nr blastxgi|1951254751e-9545.16%GI12509 [Drosophila mojavensis]
Group
Gene OntologyGO:00065065.6e-78GPI anchor biosynthetic process
GO:00167575.6e-78transferase activity, transferring glycosyl groups
GO:00312275.6e-78intrinsic to endoplasmic reticulum membrane
KEGG pathwaydmo:Dmoj_GI125092e-92 
 K05286 (PIGB)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[16-511] IPR0055995.6e-78GPI mannosyltransferase
Orthology groupMCL14867 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200524-TA
ATGGCTTTGGCAAAGGGCCTGAGGCCCGTGCAGGTTGGGGCAGTAATATTGGCGGTGCGTATTTTATCCGTGTTCCTAGTGCAAACCTGGTATGTGCCGGATGAGTATTGGCAAACCTTAGAAGTGGCACACAAATATGCCTTCGGTTATGGAGCCCTGACTTGGGAGTGGCAAAAGGGGATACGAAGCTATCTATACCCCAGTGTAGTCGCTGTGCTTTACTCCGTGTTGAAATTCACTGGCCTTGATTATCCAAATGTTGTGATAATTCTACCTCGTATTCTGCAAGCAATCATTAGTTCTATAGCTGACTACAAATTCTACAAATGGACTGGAAATCGTAAATGGGCATTGTTTTTGATACTTACATCCTGGTTTTGGTTCTATACTGCCAGTCGTACACTGTTACAGACCTTGGAAACAGCATTTGTGGCCATAGCCCTATCTGTATTTCCATTCAAAACTGGCAAGCTTGGATATTATGAGAAAGAAAGTTCAACATGGTTATGGCTGGCGTGTGTGTCCGTCTTTGTTCGGCCGACCTCAGCTCCATTATGGATAGTGCTTGGAATTTACAATATGGTTACAACCAACCAAGGAAGGATTGAGTTGTTATTGAAGACCTATTTACCAATCGCTTTTATATGTGGTGTTATGTTGGTCGGACTTGATTCCTACCTGTATGGCCGCCTTATTGTAACTCCGTGGGAGTTCTTCAAGTATAACGTCCTCGGCGGTGTAGCTTCATTCTACGGAGAGCATCCATGTTTCATACCTCACAAGGAGTTTCGGTTTGTGTTGCCGCTGCTACCAATACTGCTGTATTTGGCCCAAGACGTTATTGTACCGTGGAGTAGAAAAGCCAAGAAGTGGCAATTATATGGTGTGACAATGCTCATGTTAGTTGGTAATTTACTGCCGAGCTTGTACTTCGGTGTGATCCATCAAGCTGGGACGTTGGCGGTGATGCCGGTGTTAAGGGAATCCCTAACAGAGAACCGATCGTCTATATTATTCATGATGCCATGTCATTCAACACCTCTGTACAGGCATTGGTATATAACTCAAGGATTGCCAGCGATACTTGGTATAAATGTGTTGCCAGTTGTATGGGCTCTATATAAAATAATATCCCGTCCAAGTGAAAATAAGATGGGGTTATTGTTGATTACTGCTGCTGGCTTACATGTTTTATTACATAGTTTCATACCTCACAAAGAGTTTCGGTTTGTGTTGCCGCTGCTACCAATACTGCTGTATTTGGCCCAAGATGTTATTGTACCGTGGAGTAGAAAAGCCAAGAAGTGGCAATTATATGGTGTGACAATGCTCATGTTAGTTGGTAATTTACTGCCAAGCTTGTACTTCGGTGTGATCCATCAAGCTGGTACGTTGGCGGTGATGCCGGTGTTGAGGGAATCCCTAACAGAGAACCGATCGGCTATATTATTCATGATGCCATGTCATTCAACACCTCTGTACAGTCACTTACATCGCAACGTATCAACTCGTACTTTGGACTGTTCCCCTCCACCTGAAGGTCGGACGTGTGAGTCAGACGCGTTCTTCATGAATCCGAATCGTTGGTGGAACGCGGAGTACGCACACAGACAGACGCCCAGTCACCTAGTGTTATTCGACGTACTAAAGGGCAGGGTGGACTCGCTGTTACAGGGTTACGACTTAATACAAAGGATAAATCATACTCAGTTCCCACAAGGAGAAGTCGGAGAAAAGGTGTTAGTTTACAAGAAAAGGACTCAGAAGATACAAGCGGATGAAATAATACATTGA

Protein sequence:

>DPOGS200524-PA
MALAKGLRPVQVGAVILAVRILSVFLVQTWYVPDEYWQTLEVAHKYAFGYGALTWEWQKGIRSYLYPSVVAVLYSVLKFTGLDYPNVVIILPRILQAIISSIADYKFYKWTGNRKWALFLILTSWFWFYTASRTLLQTLETAFVAIALSVFPFKTGKLGYYEKESSTWLWLACVSVFVRPTSAPLWIVLGIYNMVTTNQGRIELLLKTYLPIAFICGVMLVGLDSYLYGRLIVTPWEFFKYNVLGGVASFYGEHPCFIPHKEFRFVLPLLPILLYLAQDVIVPWSRKAKKWQLYGVTMLMLVGNLLPSLYFGVIHQAGTLAVMPVLRESLTENRSSILFMMPCHSTPLYRHWYITQGLPAILGINVLPVVWALYKIISRPSENKMGLLLITAAGLHVLLHSFIPHKEFRFVLPLLPILLYLAQDVIVPWSRKAKKWQLYGVTMLMLVGNLLPSLYFGVIHQAGTLAVMPVLRESLTENRSAILFMMPCHSTPLYSHLHRNVSTRTLDCSPPPEGRTCESDAFFMNPNRWWNAEYAHRQTPSHLVLFDVLKGRVDSLLQGYDLIQRINHTQFPQGEVGEKVLVYKKRTQKIQADEIIH-