Monarch geneset OGS2.0

DPOGS215923
TranscriptDPOGS215923-TA1050 bp
ProteinDPOGS215923-PA349 aa
Genomic positionDPSCF300029 + 726706-728734
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0078403e-14269.34% 
BombyxBGIBMGA000293-TA1e-10767.18% 
DrosophilaCG7488-PA7e-6940.67% 
EBI UniRef50UniRef50_D6WW147e-7547.35%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WW14_TRICA
NCBI RefSeqXP_966917.11e-7547.35%PREDICTED: similar to CG7488 CG7488-PA [Tribolium castaneum]
NCBI nr blastpgi|910884193e-7447.35%PREDICTED: similar to CG7488 CG7488-PA [Tribolium castaneum]
NCBI nr blastxgi|910884191e-7247.21%PREDICTED: similar to CG7488 CG7488-PA [Tribolium castaneum]
Group
Gene OntologyGO:00037232.5e-34RNA binding
GO:00056222.5e-34intracellular
GO:00055252.5e-34GTP binding
KEGG pathway 
InterPro domain[40-335] IPR0056622.5e-34GTP-binding protein Era
[38-167] IPR0052251.4e-18Small GTP-binding protein domain
[50-162] IPR0029179.4e-16GTP-binding domain, HSR1-related
[249-342] IPR0159461.4e-13K homology domain-like, alpha/beta
[41-61] IPR0060732.9e-06GTP1/OBG
Orthology groupMCL15353 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215923-TA
ATGTTCACACTTGTTTTGATACCATTGCGTTTTAAAAATCGGAGCCCTAACCTATTAAGAGCCCTTTATTCTTCACAACCACAACAGGAATCGAAGAAAGATATAGGCAAAGTAGTTAATGTGGCTATCGTAGGGGCTCCCAATGCTGGAAAGAGTACACTCATAAATAGAATTGTGGACAGAAAGATTTGTGCTGCTTCAAACAAAGTTCATACAACTACTAAAATGGTTAGAGCTATGTGTTATGATAAGGATACTCAAATTGTATTTTTAGACACACCGGGTGTTGTAAATACCAAGGAGCAAAAGAAGTACAAACTGCCGGAATCCATGCTTCAAGCTTGTCAGAAAAGTCTCAGATGTGCTGATGTAATAGGTGTGGTCCATGATGTATCAAACCGGTGGACAAAGAATTATTTAAATACAGATGTGGTTAAAATTGTTGAAATGGTGAAAGAAATTCCTAGCTTCTTAATTCTTAACAAGGTTGATGTTTTAAAGTCTAAAGCACAGTTATTAACAATCATTAGAAATCTTACTAATGGTATAATTGCTGGTAACCCTATACCAAATACAGATAAAAAAAGTAAAATAGAAAAAGGTTATAGTTATTTTTCTGATGTTTTCCTGGTTTCTGCCATGAATGGTGATGGTGTTGGGGATATTAAGCAATACTTGATCAACAACGCAAAAGTCCGGACATTACAATATTCTCCAACTGAATGGTCAGATCAGACTCCTGAGAGGTTAATAGAGGAAGCGGTTAGAGCGAAATTTTTAGATTTTCTAGCAAATGAAATACCATACAATTTGAAGATCCGTTTGGAGTATTACGATGAGATAGAGGAGGAAGACAGGATTGTATGTTCAGTAGCTGTGGAATGCCCTTCGGAGAGATTGGCTAGATTGATAAGCGGTGCGGGCGGAGGGAGGCTCCAACAGATCAAGTCGCACGTCAGAAATGATCTCATGGAAATGTTTAAAAAAACTGTCCTCATAGATCTTAAGCTGCACGTTAAAAGTAAACCGGAACTTGAGAATGTTGTATAG

Protein sequence:

>DPOGS215923-PA
MFTLVLIPLRFKNRSPNLLRALYSSQPQQESKKDIGKVVNVAIVGAPNAGKSTLINRIVDRKICAASNKVHTTTKMVRAMCYDKDTQIVFLDTPGVVNTKEQKKYKLPESMLQACQKSLRCADVIGVVHDVSNRWTKNYLNTDVVKIVEMVKEIPSFLILNKVDVLKSKAQLLTIIRNLTNGIIAGNPIPNTDKKSKIEKGYSYFSDVFLVSAMNGDGVGDIKQYLINNAKVRTLQYSPTEWSDQTPERLIEEAVRAKFLDFLANEIPYNLKIRLEYYDEIEEEDRIVCSVAVECPSERLARLISGAGGGRLQQIKSHVRNDLMEMFKKTVLIDLKLHVKSKPELENVV-