Monarch geneset OGS2.0

DPOGS206710
TranscriptDPOGS206710-TA1062 bp
ProteinDPOGS206710-PA353 aa
Genomic positionDPSCF300495 - 45715-48641
RNAseq coverage647x (Rank: top 20%)
Annotation
HeliconiusHMEL0110034e-2041.74% 
BombyxBGIBMGA013653-TA2e-7042.42% 
DrosophilaCG3033-PA7e-1430.30% 
EBI UniRef50UniRef50_D6WQR25e-4131.05%Putative uncharacterized protein n=2 Tax=Coelomata RepID=D6WQR2_TRICA
NCBI RefSeqXP_975265.19e-4231.05%PREDICTED: similar to AGAP010738-PA [Tribolium castaneum]
NCBI nr blastpgi|910871372e-4031.05%PREDICTED: similar to AGAP010738-PA [Tribolium castaneum]
NCBI nr blastxgi|910871376e-4031.05%PREDICTED: similar to AGAP010738-PA [Tribolium castaneum]
Group
Gene OntologyGO:00427651.6e-16GPI-anchor transamidase complex
GO:00160211.6e-16integral to membrane
KEGG pathwaytca:6641582e-41 
 K05289 (GAA1)maps-> Glycosylphosphatidylinositol(GPI)-anchor biosynthesis
InterPro domain[120-347] IPR0072461.6e-16Gaa1-like, GPI transamidase component
Orthology groupMCL12765 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206710-TA
ATGCTGGCTGCTACTAACAGGTTTATATCTATTGGTCAGTACATGCCATCACTGTGTCTTCTATGTGGTGCTATGCTCATACGAGCTCTATCACTGTGGGTGACGTTACAGAAGGATGATGAAAAAGATGAAGCTATCGGTGATACAAAAGTAGCGGAAAAAGATAAAATAGGAAATAATGTTAATGATAAAGCAGAAGGTGACGACGAAAAAATTGCTAGAGAAACAGATGCTAAGGAGTTAACAAAAGAAGAAACTCAAGAGATTACAAAAGATATAGATAAAGATAAGTCCGAAAGTGTTATGAGGGGGACGGAAGATAAGGTCGCAAACATTACCAAAGATAAATATATAGGAAAAGAACAGACAGAGAATAACGCAGAAGAGAAAGAGGAACTTAAAAATAGTCGAGTGAACGGCTTTAGTATAGCCAATGTAGGTGGTAACTATTTGCTAGTGCATCTCATGGGATATACAGTCATGAATTTACCACCGCTCTTCACTTATATTGGTGCCATACACTTCTCTCTAGCGTCCGAGGTGTCCGTGTTTTATGGGATGCTGTCCTCGTCAGCTATATTCATACTATTATCGCCAAAGTGTTTGAGAACCCCGTCCGAGCTGACGCGCGATGAGGTGACCGTAGTCAACATACTGATGTTAATAGAGCTGTCCACCGCGTGCCTCGCTATCGGTGTCCATAACTTTCCGTTGGGAGTCTGCTTGGCCGCTCTGTACACGCCGCTGGCGTTAATCGTCGGGGTTGTGGAAGATAAGAAACAAGGAAGACTCATCCTGTTCGTTAAGCGGGTTGTCTGTTTGTTGCTGCAGCCTTTACTCATTCTGATGATTCTGATGATTCTATATTCGCGGGTTCTGTTCCCCGAGGAAGGTATTTTCTTGATGGCGAGTCGTGGTAAGGACGCCGCTATGCAAGCCATCATGTTCTCTATAGTGGATTCAATGATTTACGGCAACTGGTTGTTCAACATTGTGTGCACGGTCATTTTACCGACTTGGATATTGTCGTGGCAGATTTTGTGGAACAGAGTCCAAGTCTAA

Protein sequence:

>DPOGS206710-PA
MLAATNRFISIGQYMPSLCLLCGAMLIRALSLWVTLQKDDEKDEAIGDTKVAEKDKIGNNVNDKAEGDDEKIARETDAKELTKEETQEITKDIDKDKSESVMRGTEDKVANITKDKYIGKEQTENNAEEKEELKNSRVNGFSIANVGGNYLLVHLMGYTVMNLPPLFTYIGAIHFSLASEVSVFYGMLSSSAIFILLSPKCLRTPSELTRDEVTVVNILMLIELSTACLAIGVHNFPLGVCLAALYTPLALIVGVVEDKKQGRLILFVKRVVCLLLQPLLILMILMILYSRVLFPEEGIFLMASRGKDAAMQAIMFSIVDSMIYGNWLFNIVCTVILPTWILSWQILWNRVQV-