Monarch geneset OGS2.0

DPOGS200861
TranscriptDPOGS200861-TA1140 bp
ProteinDPOGS200861-PA379 aa
Genomic positionDPSCF300071 + 296140-298594
RNAseq coverage233x (Rank: top 44%)
Annotation
HeliconiusHMEL0126430.085.16% 
BombyxBGIBMGA009849-TA5e-17176.56% 
DrosophilaRtc1-PA8e-12458.14% 
EBI UniRef50UniRef50_Q7Q8X57e-12356.76%AGAP010498-PA n=4 Tax=Culicidae RepID=Q7Q8X5_ANOGA
NCBI RefSeqXP_967971.11e-12658.84%PREDICTED: similar to GA10780-PA [Tribolium castaneum]
NCBI nr blastpgi|910797863e-12558.84%PREDICTED: similar to GA10780-PA [Tribolium castaneum]
NCBI nr blastxgi|1954485612e-12257.81%GK10124 [Drosophila willistoni]
Group
Gene OntologyGO:00063967.1e-165RNA processing
GO:00038244e-57catalytic activity
KEGG pathway 
InterPro domain[1-379] IPR0002287.1e-165RNA 3'-terminal phosphate cyclase
[1-379] IPR0164437.6e-157RNA 3'-terminal phosphate cyclase-like, eukaryotic
[11-340] IPR0237971.5e-61RNA 3'-terminal phosphate cyclase domain
[14-256] IPR0137924e-57RNA 3'-terminal phosphate cyclase/enolpyruvate transferase, alpha/beta
[183-288] IPR0137961.6e-29RNA 3'-terminal phosphate cyclase, insert domain
Orthology groupMCL13374 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200861-TA
ATGCCTGCAATTTTACAAAATGATGTGCTAGTATATAAAGGCAGCAATTTATTCCGCCAAAGATTGCTCCTTGCGACGTTAAGTGGCAGAGCTATAAAGATTGAGGAAATTCGAAGTTCTTTTGATGATCCAGGCTTAAGAGAGTATGAAGTCAATTTAATACGTTTACTGGACAAAATAACAAACGGATCAAGAGTAGAATTAAGTGAAACTGGAACCTCTTTGTACTTTCAACCAGGAATTTTAATTGGTGGTCAAGTCACCCACAGTTGTTGCACACAGAGAGGTATAGAAGTATTGCTAGCGCTGGGACCATTCTGTAAAGAGCCGTTGAATGCTGTACTGCAGGGTGTGACTCATCATGAGTTGGATGTATCCGTTGACAAAGTAAAGTCTGCAGCATTACCAATATTGCTTAAGTTTATATTAGTCGATGACGGCTTGGAACTTAAAGTTGTCAGAAGAGGAGCTCCACCTTTGGGCGGTGGTGAAATAGTGTTCAAGTGTCCCGTCCGCCGTCATCTGCGGCCACTCCAGTGCAGTAAATGGGGAATGGTGAAGAGAATCAGAGGGGTCGTGTACGCCTTAAGAGTATCACCGACCATGGCCAATAGAGTTGTAGAGGCAGCTAAAGGGGTGATGTTAAACTTCCTCCCCGATGTTTATATAAACACAGACCAGTGTCGTGGAGCGAATGCCGGGAAGAGTCCTGGATTCGGAGTCAGTTTAGTCGCTGAGACTACTGATAAGACATTTTACTGTGCCGAAGCAAAGTCGGCCGAGGTTGGTTCCGGTGAGCAAACACTTCCCGAGGACTTGGGTCGGGAGTGCGCCCACATGTTGCTGGACGAGGTGCGCCGGGGCGGAGCAGTCGACAGTTCCTTCCAGTGGTTGTTAGCGCTCTGGATGGCGCTCGGACAGAAAGACGTCAGTGAGTGTGTTGTTGGACCCCTCTCTGATTACACAATCAAGTTTCTGCAACATCTCAAAGAGTTCTTCGGCGTGATGTTCAAGTTAGAAGTCCTGAGATCTGAAGACGATGAGAGCTCAGACGAGGAGGACAAGTTCGCGATAGCACAGAAGATTAAAATGACTTGTGTTGGAATAGGATATGTGAATATTAGTAAAAGGACTTTATAA

Protein sequence:

>DPOGS200861-PA
MPAILQNDVLVYKGSNLFRQRLLLATLSGRAIKIEEIRSSFDDPGLREYEVNLIRLLDKITNGSRVELSETGTSLYFQPGILIGGQVTHSCCTQRGIEVLLALGPFCKEPLNAVLQGVTHHELDVSVDKVKSAALPILLKFILVDDGLELKVVRRGAPPLGGGEIVFKCPVRRHLRPLQCSKWGMVKRIRGVVYALRVSPTMANRVVEAAKGVMLNFLPDVYINTDQCRGANAGKSPGFGVSLVAETTDKTFYCAEAKSAEVGSGEQTLPEDLGRECAHMLLDEVRRGGAVDSSFQWLLALWMALGQKDVSECVVGPLSDYTIKFLQHLKEFFGVMFKLEVLRSEDDESSDEEDKFAIAQKIKMTCVGIGYVNISKRTL-