Monarch geneset OGS2.0

DPOGS204565
TranscriptDPOGS204565-TA3114 bp
ProteinDPOGS204565-PA1037 aa
Genomic positionDPSCF300300 - 235080-240127
RNAseq coverage955x (Rank: top 13%)
Annotation
HeliconiusHMEL0083840.096.57% 
BombyxBGIBMGA001540-TA0.095.27% 
DrosophilaUpf1-PA0.076.62% 
EBI UniRef50UniRef50_Q9VYS30.076.62%Regulator of nonsense transcripts 1 homolog n=40 Tax=Eukaryota RepID=RENT1_DROME
NCBI RefSeqXP_001604124.10.076.19%PREDICTED: similar to nonsense-mediated mrna decay protein 1 (rent1) [Nasonia vitripennis]
NCBI nr blastpgi|3454913480.075.93%PREDICTED: regulator of nonsense transcripts 1-like isoform 3 [Nasonia vitripennis]
NCBI nr blastxgi|3454913480.075.84%PREDICTED: regulator of nonsense transcripts 1-like isoform 3 [Nasonia vitripennis]
Group
Gene OntologyGO:00036771.5e-73DNA binding
GO:00055241.5e-73ATP binding
GO:00082701.5e-73zinc ion binding
GO:00001841.5e-73nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
GO:00043861.5e-73helicase activity
GO:00057371.5e-73cytoplasm
GO:00167874.5e-06hydrolase activity
KEGG pathway 
InterPro domain[101-252] IPR0189991.5e-73RNA helicase UPF1, UPF2-interacting domain
[464-529] IPR0069354.5e-06UvrABC complex, subunit B
Orthology groupMCL14304 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204565-TA
ATGAGTGTCGACGCGTATGGCCCAAGTTCTCAAACACTCACATTCTTAGAAACCGAAGAAGCAGATCTAATCGGTGCTGACACACAAGGATCTGAGTTTGAATTTACAGACTTTACATTACCATCTCAGAGCCAAACACAAGCATCACAACACGATCATGGAATAACTTCATCCAACCAAATTAATGGACTCGGTCGTGGAGACCTTCATTCGAAAGTATCAAGTGTAGCTAATGCAATTGCGGAACTACAATTCGAAGAGGAGGATGAAGCCTTGTATAGTAGCAAGGAACTGCCAGAACATGCTTGCAAGTATTGTGGCATTCACGATCCCGCCACAGTTGTTATGTGTAACATCTGCAACAAGTGGTTTTGCAATGGGCGCGGAAACACTTCTGGATCTCATATCATTAATCATCTAGTGAGAGCTAAACATAAAGAAGCAGCACTTCATAGAGATGGCCCATTGGGAGAAACATTATTGGAATGCTACTCATGTGGGGCTCGTAATGTTTTTGTTCTCGGTTTTATACCAGCAAAAGCCGACTCAGTTGTTGTCCTTCTGTGCCGACAGCCATGTGCCGCCCAAAGCTCTCTTAAGGATATGAACTGGGATCAAGAGCAATGGAAGCCACTAATTTCTGACCGTGCATTTCTGTCTTGGCTTGTAAAAGTGCCTTCTGAAGCTGAACAAATGAGGGCAAGACAAGTGACTCCTCAACAAATTGGACGTCTTGAAGAACTATGGCGTGATAATGTCGATGCTACTTTCCAAGATTTAGAAAAACCGGGTGTAGACGAGGAGCCCCATCAAGTACTCCTGAGATATGAGGATGGATATCAATATCAGAATATATTTGGTCCTCTCGTTAAACTAGAAGCTGATTACGACAAGAGGCTCAAAGAGTCACAAACCCAGGAAGGCATAGAGGTGCGTTGGGATGTGGGTCTCAATAAGAAAACTATTGCATATTTCACCCTGGCCAAAACAGATAGTGACATGAAACTTATGCATGGAGACGAACTGAGATTGAGATATGTTGGTGAGCTACATAAAGCATGGTCTGGTGTTGGCCATGTCATTAAAGTTCCTGATAATTATGGTGACGACGTCGGTTTAGAACTGAAGAGTGGGGCCGGAGCACCCCTTGAATGTACTTCCAACTTTGTTGTTGATTTTATATGGAAGAGTACATCATTTGACAGAATGCAACTAGCTCTACGTAAATTTGCAGTAGACGATTCCTCAGTCTCTGGGTACATCTATCGTCGTCTGCTAGGTCATGAGGTAGAAGAGGTATTGTTCCGCGTACACCTGCCGAAACACTTCAGCGCACCGAACTTACCCGATCTTAACAGATCTCAGGTGTATGCAGTCAAGCACGCACTCCAACGTCCATTGTCTCTGATCCAAGGTCCTCCGGGTACTGGGAAAACCGTTACATCTGCGACCATTGTATACCAGCTCGTACGCCAAAACGGTGGTCCTGTACTCGTATGCGCTCCGTCCAACACTGCCGTAGACCAACTGACTGAGAAAATACATCGAACCGGTCTGAAAGTCGTTCGTCTCTGTGCTAAATCCAGGGAGGCTATGGAATCTTCAGTTTCCTTCTTGGCCTTACACGAACAGGCACGGGCCTTGGGCTCCGCTGATAGTGAACTTCGCAAGTTAACTAGGCTGAAGGAGGAGGCTGGTGAATTGTCTGCGGCTGATGAGAGGAGGTACCGTGCGCTCCGTAGAGCGGCCGAGAGAAGATTGCTTGACGCGGCCGATGTCGTATGTACTACCTGCGTCGGTGCTGGCGATCCCAGGGTTGCACGGATGAGGTTCCAGTCCATCCTCATCGATGAAGGCATGCAGTCTACGGAACCTGAGTGTATGGTGCCCGTAGTGCTTGGAGCGAGGCAATTAATCCTCGTCGGTGACCATTGTCAGTTAGGTCCAGTGGTTATGTGCAAAAAAGCCGCCAAAGCCGGTCTCAGTCAGAGTCTTTTTGAACGGCTCGTAGTTCTAGGCATTCGCCCCTTCCGCTTAGAAGTGCAATATCGTATGCACCCAGAGCTCTCCCGCTTTCCGTCAGACTTCTTTTACGAAGGATCACTTCAGAATGGAGTAAGTGCGGAGGAGAGACGATTGCACAAAATCGATTTCCCATGGCCAAGACCCGATAGGCCTATGTTCTTTTACGTTACTCAGGGTCAAGAGGAAATAGCTGGATCGGGAACATCGTACCTAAATCGAACGGAAGCCGCTAATGTTGAAAAGTTGACGACTCGCTTCTTGAAAGCTGGTGTTCGTCCAGAACAAATCGGGATCATCACTCCGTACGAGGGTCAAAGGTCATACCTCGTTCAGCATATGCAGTATCAAGGCAGTCTGCACGCTAAGCTATATCAAGAGATCGAAGTCGCCAGTGTGGACGCTTTCCAGGGCCGGGAAAAAGATATCATAATAATGTCCTGCGTCCGGTCCAACGAACATCAAGGAATCGGGTTTTTGAGCGATCCGCGTCGCTTGAACGTGGCATTAACACGCGCCAAGTACGGCTTAATTGTGGTCGGGAATCCGAAAGTTCTCAGCAAACAGCCGCTGTGGAACCACCTGCTAGCCTTCTACAAGGAGCGACGTGTGCTAACAGAGGGACCTTTGTCTAATCTGAAAGAGTCGGCGATACAGTTCGCAAAGCCGAAGAAGTTGGTGAACGCTCAGAATCCTGGCTCGCATTTCATGTCGACGTCGATGTTCGACGCTCGCGAGGCGATGGTCCCGGGATCCGTGTACGATCGTGCCCGTCCTCCACGCGACCCGCTCGCCTACGTCGGCCACGAGCACGCGGCGTCGCTTCACGCTCCCGTCCCGCCGGCAGCTTTCGCCGCTCACCGTCCGCAGCAACGCGCCCCGCCAGACGCGACGCGTTCCCGCCGTCGGCCGCCGCGTCTCTCACAGGAGCCGTTGTCTCAACAGCCGCCGCTGTCACTGTCACAGGGAGCGTCGCAGCCGGACTTCAGCCAGGAATCGTCCGCCCCGGACTGCCCGTCGCAGCCGGACGGGTTGCTGTCCCAGGACTCCACGTACCAGGGAGGGTTCCGCGCGCGCTGTGCCCAGTACTGA

Protein sequence:

>DPOGS204565-PA
MSVDAYGPSSQTLTFLETEEADLIGADTQGSEFEFTDFTLPSQSQTQASQHDHGITSSNQINGLGRGDLHSKVSSVANAIAELQFEEEDEALYSSKELPEHACKYCGIHDPATVVMCNICNKWFCNGRGNTSGSHIINHLVRAKHKEAALHRDGPLGETLLECYSCGARNVFVLGFIPAKADSVVVLLCRQPCAAQSSLKDMNWDQEQWKPLISDRAFLSWLVKVPSEAEQMRARQVTPQQIGRLEELWRDNVDATFQDLEKPGVDEEPHQVLLRYEDGYQYQNIFGPLVKLEADYDKRLKESQTQEGIEVRWDVGLNKKTIAYFTLAKTDSDMKLMHGDELRLRYVGELHKAWSGVGHVIKVPDNYGDDVGLELKSGAGAPLECTSNFVVDFIWKSTSFDRMQLALRKFAVDDSSVSGYIYRRLLGHEVEEVLFRVHLPKHFSAPNLPDLNRSQVYAVKHALQRPLSLIQGPPGTGKTVTSATIVYQLVRQNGGPVLVCAPSNTAVDQLTEKIHRTGLKVVRLCAKSREAMESSVSFLALHEQARALGSADSELRKLTRLKEEAGELSAADERRYRALRRAAERRLLDAADVVCTTCVGAGDPRVARMRFQSILIDEGMQSTEPECMVPVVLGARQLILVGDHCQLGPVVMCKKAAKAGLSQSLFERLVVLGIRPFRLEVQYRMHPELSRFPSDFFYEGSLQNGVSAEERRLHKIDFPWPRPDRPMFFYVTQGQEEIAGSGTSYLNRTEAANVEKLTTRFLKAGVRPEQIGIITPYEGQRSYLVQHMQYQGSLHAKLYQEIEVASVDAFQGREKDIIIMSCVRSNEHQGIGFLSDPRRLNVALTRAKYGLIVVGNPKVLSKQPLWNHLLAFYKERRVLTEGPLSNLKESAIQFAKPKKLVNAQNPGSHFMSTSMFDAREAMVPGSVYDRARPPRDPLAYVGHEHAASLHAPVPPAAFAAHRPQQRAPPDATRSRRRPPRLSQEPLSQQPPLSLSQGASQPDFSQESSAPDCPSQPDGLLSQDSTYQGGFRARCAQY-