Monarch geneset OGS2.0

DPOGS211907
TranscriptDPOGS211907-TA1263 bp
ProteinDPOGS211907-PA420 aa
Genomic positionDPSCF300011 - 123479-124741
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0177130.089.76% 
BombyxBGIBMGA001102-TA0.081.19% 
DrosophilaCG6401-PA3e-13150.21% 
EBI UniRef50UniRef50_Q179Y52e-15260.00%Glycosyltransferase n=13 Tax=Coelomata RepID=Q179Y5_AEDAE
NCBI RefSeqXP_397085.29e-17367.39%PREDICTED: similar to phosphatidylinositol glycan, class A [Apis mellifera]
NCBI nr blastpgi|3454848523e-17267.78%PREDICTED: N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein-like [Nasonia vitripennis]
NCBI nr blastxgi|665145283e-16967.39%PREDICTED: n-acetylglucosaminyl-phosphatidylinositol biosynthetic protein-like [Apis mellifera]
Group
Gene OntologyGO:00065062.5e-40GPI anchor biosynthetic process
GO:00090581.4e-23biosynthetic process
KEGG pathwayame:4136433e-172 
 K03857 (PIGA, GPI3)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[36-125] IPR0132342.5e-40PIGA, GPI anchor biosynthesis
[189-333] IPR0012961.4e-23Glycosyl transferase, family 1
Orthology groupMCL14377 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211907-TA
ATGGCCTCCGATTTCTTCTTTCCAAACACAGGTGGGGTCGAAGAACATATTTATAATTTATCGCAGTGTCTGATAAAGCGGGGTCACAAAGTAGTCATAATAACCCACTGTTATGGGAAACGAGTTGGTGTCAGATATCTCACTAGGGGGCTAAAGGTTTATTATTTACCAATTACTGTATTTTACTCTCAATGTGCTCTACCAACTATGATTTGCAACATAGCTATTGTCCGTTATATTTTAATAAGAGAATCAATCGAAATTGTTCATGGACATTCGGCTTTCAGTGTTTTAGCACACGAAGTCTCTATTATTGGTAAATTGATGGGCTTGAAAACTGTTTTTACAGACCACAGTTTATTTGGTTTTGCCGATATGTCTGCAGTTTTGACAAATAAATATCTACAAATGTGTTTGTGTGAATGCGATCATTGCATATGTGTTTCACATACTGGTAAAGAGAATACTGTGTTAAGGGCCAAAGTGAAGGCTTTCAAGGTGTCTGTAATACCGAATGCTGTAGATGCATATACATTTACACCTGATCCTAGCCAAAGAGATCCTAATGTTGTGACAATTGTGATTATATCTAGATTAGTATATAGAAAAGGAGTTGATTTAATGGCATCCGTGATAGCAGAAATATGTCCGAGATACCCAAACATAAGATTTATTATTGGAGGAGATGGTCCCAAAATGTGGTTGTTACAAGAGGTCAGGGAACAAAAAGGACTCCAACATTGCGTTACACTGCTTGGTAGTTTAAAACATTCCGAAGTTAGAAATGTACTAGTCAAGGGAGATATATTTTTAAATACTTCTTTGACTGAGGCATATTGTATGGCTATAGTTGAAGCTGCTTCATGTGGTCTTAAGGTGGTGTCAACTAAAGTTGGAGGAATACCTGAAGTATTGCCACTGAGCATGATCTATCTAACAGAACCAAATGTACCAAGCATAATCAGCGGATTAGAATCTGCAATGAAAGACTTAAAGGACGGAAACTCCCTATGTCCATACAAATGTAATAAAATGGTAAGAAGCATGTACAATTGGATGGACATAACGAAAAGAACTGAAATTGTGTATAACAGAATATTGTCTAACAAAAATAAACCATTGGGGCTGCAATTAAAGAGCTATCTTAGTTGCGGAGTTTGGCCTTTTCTCCTAGTGATAAGCCTAATGTATTTATTACTGCAACTAGCTGACAGAATATATAAGAGGAAACATATCGACATAGCAAAAGACTTAAAAGTATAA

Protein sequence:

>DPOGS211907-PA
MASDFFFPNTGGVEEHIYNLSQCLIKRGHKVVIITHCYGKRVGVRYLTRGLKVYYLPITVFYSQCALPTMICNIAIVRYILIRESIEIVHGHSAFSVLAHEVSIIGKLMGLKTVFTDHSLFGFADMSAVLTNKYLQMCLCECDHCICVSHTGKENTVLRAKVKAFKVSVIPNAVDAYTFTPDPSQRDPNVVTIVIISRLVYRKGVDLMASVIAEICPRYPNIRFIIGGDGPKMWLLQEVREQKGLQHCVTLLGSLKHSEVRNVLVKGDIFLNTSLTEAYCMAIVEAASCGLKVVSTKVGGIPEVLPLSMIYLTEPNVPSIISGLESAMKDLKDGNSLCPYKCNKMVRSMYNWMDITKRTEIVYNRILSNKNKPLGLQLKSYLSCGVWPFLLVISLMYLLLQLADRIYKRKHIDIAKDLKV-