Monarch geneset OGS2.0

DPOGS202883
TranscriptDPOGS202883-TA1077 bp
ProteinDPOGS202883-PA358 aa
Genomic positionDPSCF300126 - 396967-402522
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0145881e-3181.25% 
BombyxBGIBMGA004159-TA3e-9155.07% 
DrosophilaCG4061-PA9e-7642.66% 
EBI UniRef50UniRef50_Q7Q2U71e-9547.79%AGAP004820-PA n=4 Tax=Culicidae RepID=Q7Q2U7_ANOGA
NCBI RefSeqXP_971161.12e-10253.62%PREDICTED: similar to RNA 3 terminal phosphate cyclase [Tribolium castaneum]
NCBI nr blastpgi|910875655e-10153.62%PREDICTED: similar to RNA 3 terminal phosphate cyclase [Tribolium castaneum]
NCBI nr blastxgi|910875653e-9753.94%PREDICTED: similar to RNA 3 terminal phosphate cyclase [Tribolium castaneum]
Group
Gene OntologyGO:00063961.2e-125RNA processing
GO:00039632.6e-112RNA-3'-phosphate cyclase activity
GO:00038241.5e-65catalytic activity
GO:00168861.7e-08ligase activity, forming phosphoric ester bonds
KEGG pathway 
InterPro domain[1-357] IPR0002281.2e-125RNA 3'-terminal phosphate cyclase
[6-332] IPR0177702.6e-112RNA 3'-terminal phosphate cyclase, subgroup
[277-340] IPR0237976.8e-93RNA 3'-terminal phosphate cyclase domain
[4-183] IPR0137921.5e-65RNA 3'-terminal phosphate cyclase/enolpyruvate transferase, alpha/beta
[184-283] IPR0137963.1e-14RNA 3'-terminal phosphate cyclase, insert domain
[184-284] IPR0137911.7e-08RNA 3'-terminal phosphate cyclase, subset, insert domain
Orthology groupMCL11790 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202883-TA
ATGGTTCACATGCAAGAAATAGATGGAAGCGTGCTGGAAGGGGGTGGGCAAATTCTTAGAATGTCCATATCACTGAGCGCTATACTGAATATACCAGTAAGAGTTACAAATATTCGAGCGGCACGCAAGAATCCAGGGTTAGCTGCTCAACATTTAAAAGGTATTCAGCTAGTTGCAAATATGTGCCAAGCCGAGCTTAAAGGAGCTTATATTGGATCAACAGAAATTGAGTTTACCCCCGGCAAAATCCGAGGAGGACATTATGTTGCTGACACACAAACAGCAGGTTCAATAAGTCTGCTTATCCAAGTGGCCCTGCCGTGTGCACTGTTTGCCGATGGCCCAACTATTATGGATTTGAAGGGCGGCACCAATGCGGACATGGCACCGCAGATTGATTACATGGATATGGTATTCCGGAAGATTCTTAACAAATTTGGTGCTGATTTCAACATGCAGATTCTGAGACGCGGCTACTTTCCACGTGGTGGAGGTCACGTTCGAGTTGAAATAACTCCGGTACATATGCTGCGTAGTATCAGTCTAATGGATAGGGGAGAGATTGGAGACATCGGTGGAATCAGTTTCGTGGCCGGAAACTTGCCAGTCAAGTTCGCCTATCAGATGGCAGATGGCGCTAAACATGAAATGGGATCAAATCACAGATTAAACATCAGAAGTTACAAAGAGGACAGGTCGATGGCCCCGGACAACTGTAATGGAATCGTTCTGTCATGTTCGACGCCGTCGTGTGTGTTGGGGGCGTGTGGCCTCGGGAGGCGCGGGGTGAGCCCCGGGGAGGTCGGGGGCTCCGCCGGCAGACTACTGAGACAGGTCATCGACTCGGGGGTCTGTGTGGACTCACACGCGCAGGATCAAGTGATACTGTATATGAGTCTAGCACCCGGCCTGTCCTCGGTCCGCTCGGGCTCCTCCACTCTGCACACACAGACGGCCATTCACATAGCGGAGACACTTGCTAAGGTTAAGTTCGAGATAACGTCAGAAGGCGGACAGGATATTATAAAGTGTGCTGGGATCGGTCTCGTTAACAAATCCCTTCCTGAAGAACAATGA

Protein sequence:

>DPOGS202883-PA
MVHMQEIDGSVLEGGGQILRMSISLSAILNIPVRVTNIRAARKNPGLAAQHLKGIQLVANMCQAELKGAYIGSTEIEFTPGKIRGGHYVADTQTAGSISLLIQVALPCALFADGPTIMDLKGGTNADMAPQIDYMDMVFRKILNKFGADFNMQILRRGYFPRGGGHVRVEITPVHMLRSISLMDRGEIGDIGGISFVAGNLPVKFAYQMADGAKHEMGSNHRLNIRSYKEDRSMAPDNCNGIVLSCSTPSCVLGACGLGRRGVSPGEVGGSAGRLLRQVIDSGVCVDSHAQDQVILYMSLAPGLSSVRSGSSTLHTQTAIHIAETLAKVKFEITSEGGQDIIKCAGIGLVNKSLPEEQ-