Monarch geneset OGS2.0

DPOGS207098
TranscriptDPOGS207098-TA1536 bp
ProteinDPOGS207098-PA511 aa
Genomic positionDPSCF300001 + 3042490-3048906
RNAseq coverage333x (Rank: top 35%)
Annotation
HeliconiusHMEL0132893e-4341.06% 
BombyxBGIBMGA013061-TA3e-3555.30% 
DrosophilaCG31120-PB9e-5228.12% 
EBI UniRef50UniRef50_E2BYK57e-7738.20%GPI transamidase component PIG-S n=7 Tax=Formicidae RepID=E2BYK5_HARSA
NCBI RefSeqXP_397105.27e-9037.66%PREDICTED: similar to CG31120-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3320300394e-9439.26%GPI transamidase component PIG-S [Acromyrmex echinatior]
NCBI nr blastxgi|3320300392e-9339.07%GPI transamidase component PIG-S [Acromyrmex echinatior]
Group
KEGG pathwayame:4136642e-89 
 K05291 (PIGS)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[12-496] IPR0195409.8e-82Phosphatidylinositol-glycan biosynthesis class S protein
Orthology groupMCL11888 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207098-TA
ATGGCGGAGGCTAATCGTATGTGGGCCAGCGCGTCGTTTGTTGGTGTCCTGATCATTATAGGCTTGCCATTATGGTGGAAGACCACTGAAGTGTACAGGGTGGCTCTTCCCTACGATGAGATTAACGCTTTCGATACTGATTCCCACTTCATCACAACCGAGCTGGTGGTGCTAGCCAACGATGAGAAGACCGCTGCTAAGATGGCCGAGGATATTGCGAAGGCTTTCGAGCAGTCAGATATAATAAAACTAAAGATCAGGCACCAGAAGTTATCGGACAGTCTTCGTCAGACCCTGGACATGGTTGCGGATGAACAGGAGGCTCTGGAGGAGGTCGCAGCGGCGTTTGACGTCACGGGACACAATACCTTCCACGTGGTACAAAGAAGACACCTATTCCAGAATGTGTGGATTGGAGGAGAAAGGGTGGCATTCTTCAGAGACTTAAAAGCATCGGAAACATTAATCCGGGCGCTGTCATCGTGGGTGCTGCAACCGTTATTGCTGGAGGGCGCTCGTTCGTTAACTGCCGATGACGCACGACGCGTGCGGCTCCCTAGCGAGCAACACCTGGACCTGCTGATGACCGTGGTGCACCCGCGACCTCAAAAAATGAAGGTTGTGTTCGACGCGCCAGCTGTGCTAGAAGATTACGTCGGTTCATTTGTTGACGAGTTGTCATATTTACACAATGTCACCCTCAAGTCACAATGGCTGCACCTCACGGAGTTCGACTTCCAGGCGAAACAGGTGTTGAGTCCATCGGGTCCTCAGTGGTGGGTCAGCGTGGAGGCCGACGGCGACGAGGAATGGGAGCTGGACGGACGAGGGCTAGGAACTATTCGTCTTGCGGCGTACATACTGCCCTGTGACGTCACACCGCTTGTATTACACGACAGAGGAAGTCGAGTGGAGGCTGCAGTACCAGCTTTTATGTCTCCTAAATGGGGTGGTGTTGTATTACTGAACCCCTCCTGTGAGGAATGCTCCAAGGGTCAGTACGTGGTGCCCGTGAACATGCTCATGGGAACTTTCATCTCACAACTCAGGAAACTACTCGGTATCACTGATAAGGTTAATATTCCCGAGGCGTATCTGCTTCCTCTCGACTCCGTCTCACCGCGTCTTTGGGAATTGGATTCGCTCCTACGTCTGCGTACCCTGGAACAGGTCACTAGCGCACAGAGGAGCTTGAAGTCTCTAGCCAAACTTCTGGGCGAGATCCCGAATATCGTGATAACGGATAAGGTGGGAAGCTACATCCAGTCAGCTGTTCAACACGCACATGAAGCTCGCGTGTACTGCGAACGTGGGTCCCTGGTGGAGGCGCACGAGCACGCTCGCTCCGCCAACGCCGCCGCTGAGACCGCCTTCACGGAACCCAGCCTGCTGGCACTGTTATACTTCCCCGACGATCAGAAATACGCGATCTACATACCATTGTTCCTACCCATCATGTTTCCCGTGATACTGTCGTTGAAAAATCTTATTCTGTGGTTTAAGGGGAAACCGGTTGTTAAGGAGAAGACTGATTAA

Protein sequence:

>DPOGS207098-PA
MAEANRMWASASFVGVLIIIGLPLWWKTTEVYRVALPYDEINAFDTDSHFITTELVVLANDEKTAAKMAEDIAKAFEQSDIIKLKIRHQKLSDSLRQTLDMVADEQEALEEVAAAFDVTGHNTFHVVQRRHLFQNVWIGGERVAFFRDLKASETLIRALSSWVLQPLLLEGARSLTADDARRVRLPSEQHLDLLMTVVHPRPQKMKVVFDAPAVLEDYVGSFVDELSYLHNVTLKSQWLHLTEFDFQAKQVLSPSGPQWWVSVEADGDEEWELDGRGLGTIRLAAYILPCDVTPLVLHDRGSRVEAAVPAFMSPKWGGVVLLNPSCEECSKGQYVVPVNMLMGTFISQLRKLLGITDKVNIPEAYLLPLDSVSPRLWELDSLLRLRTLEQVTSAQRSLKSLAKLLGEIPNIVITDKVGSYIQSAVQHAHEARVYCERGSLVEAHEHARSANAAAETAFTEPSLLALLYFPDDQKYAIYIPLFLPIMFPVILSLKNLILWFKGKPVVKEKTD-