Monarch geneset OGS2.0

DPOGS204327
TranscriptDPOGS204327-TA897 bp
ProteinDPOGS204327-PA298 aa
Genomic positionDPSCF300142 - 117974-119240
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0023222e-6545.63% 
Bombyx% 
DrosophilaGpi1-PB8e-1337.62% 
EBI UniRef50UniRef50_F4X5R71e-3242.93%Phosphatidylinositol N-acetylglucosaminyltransferase subunit Q n=6 Tax=Formicidae RepID=F4X5R7_ACREC
NCBI RefSeqXP_001120160.12e-3340.00%PREDICTED: similar to phosphatidylinositol glycan, class Q [Apis mellifera]
NCBI nr blastpgi|3320174343e-3242.93%Phosphatidylinositol N-acetylglucosaminyltransferase subunit Q [Acromyrmex echinatior]
NCBI nr blastxgi|3838656156e-3546.75%PREDICTED: phosphatidylinositol N-acetylglucosaminyltransferase subunit Q-like [Megachile rotundata]
Group
Gene OntologyGO:00065069.8e-26GPI anchor biosynthetic process
GO:00171769.8e-26phosphatidylinositol N-acetylglucosaminyltransferase activity
GO:00160219.8e-26integral to membrane
KEGG pathwayphu:Phum_PHUM4296603e-27 
 K03860 (PIGQ, GPI1)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[171-297] IPR0077209.8e-26N-acetylglucosaminyl transferase component
Orthology groupMCL17120 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204327-TA
ATGTTAAAGGGTTTTATTAAGAGAGATGAAAATACAATAACAGTTTTTATAACTACATTTTCTAACCAAAACATATTAGAAGGCTGCTTAAATAATCCAAACATTATTTATGGATGTAATATCGGGAATCGATCCCGAAATAAGCTTAGACGATATCAAAGTAATTTTACGGTAAATTTAGACACTGATCCCATAACTGTTACTAGTTTAGTAATTGATGGTCAAATACAAAATCTGAAAGAAAACTGCATCGTAATGTTATATGAATACCAAAAAATGAAAAAGTCGGAGACAATTAGCAGCAGTGACGTATGTTTCTCAACATTACAAAATCTTATTCACAAGGATAGTGAAAGTACCAATGATGAACTTTTGCCCGAGGTGCAATTTCAAAGCTCTTCCTTTTTAGGATCATGTATGTTTGCACAACATGTCTATAATTACTTTAATTTGTTAAAATGGCTTTTACATACAGTAAAGAAAAAGGAAAAGATAACAATAAAACAGAGAAACCTAATGTTGGCGATTGTTGGAGATATAATTTTAGGGTATTTGACAGTTGAACTATTAATGTTTGATAAGAAATACCTTGGAACACTAATCTTGGGGTTGTTAGAGAAATTAGTGAATCTTATGTATTCTTTATTGAAATGGCTAATGGGTGCTCCTGCTGGATTGAAATTAAACAATGCATTTAACAAAATGTTAGGAAAATTTTTCTTATATCATGTTGAGTTATGGTGGTTGTTTTTAGGCAATGACAAATTAGATATCATATTAAGTGTGTGCCAATATTTGGGTTACTTGGGATTCACATTTCAGGCAGCCATTATATCTGATATGATTTGTATGGCGACATTTCACTCGTATTGTATTTATATCTATGCGGCAAGGTGA

Protein sequence:

>DPOGS204327-PA
MLKGFIKRDENTITVFITTFSNQNILEGCLNNPNIIYGCNIGNRSRNKLRRYQSNFTVNLDTDPITVTSLVIDGQIQNLKENCIVMLYEYQKMKKSETISSSDVCFSTLQNLIHKDSESTNDELLPEVQFQSSSFLGSCMFAQHVYNYFNLLKWLLHTVKKKEKITIKQRNLMLAIVGDIILGYLTVELLMFDKKYLGTLILGLLEKLVNLMYSLLKWLMGAPAGLKLNNAFNKMLGKFFLYHVELWWLFLGNDKLDIILSVCQYLGYLGFTFQAAIISDMICMATFHSYCIYIYAAR-