Monarch geneset OGS2.0

DPOGS205411
TranscriptDPOGS205411-TA2394 bp
ProteinDPOGS205411-PA797 aa
Genomic positionDPSCF300407 + 220750-232851
RNAseq coverage370x (Rank: top 32%)
Annotation
HeliconiusHMEL0071530.093.59% 
BombyxBGIBMGA001579-TA0.091.30% 
DrosophilaCbp80-PB0.074.78% 
EBI UniRef50UniRef50_Q7K4N30.074.78%Nuclear cap-binding protein subunit 1 n=44 Tax=Coelomata RepID=NCBP1_DROME
NCBI RefSeqXP_973056.20.077.40%PREDICTED: similar to AGAP001195-PA [Tribolium castaneum]
NCBI nr blastpgi|1892345390.077.40%PREDICTED: similar to AGAP001195-PA [Tribolium castaneum]
NCBI nr blastxgi|1892345390.077.40%PREDICTED: similar to AGAP001195-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160702.4e-113RNA metabolic process
GO:00054882e-98binding
GO:00055151.2e-36protein binding
KEGG pathwaytca:6618290.0 
 K12882 (NCBP1, CBP80)maps-> Spliceosome
InterPro domain[30-289] IPR0160212.4e-113MIF4-like, type 1/2/3
[483-797] IPR0160242e-98Armadillo-type fold
[487-759] IPR0151741.5e-71MIF4G-like, type 2
[342-473] IPR0151726.8e-56MIF4G-like, type 1
[30-242] IPR0038901.2e-36MIF4G-like, type 3
Orthology groupMCL14287 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205411-TA
ATGAATCGTAGAAGGGCACATGAAGAAGAAGACGGTTATGAACGTCTGCATAGAAAACGCCGGAGGGTTTCAGAAAATCAAGAAATTGAGGATAGATTAGAATCCCTTATTTTAAGAGTAGGAGAGAAAAGTAGTTCTAGCTTAGAGAGCAATTTAGAAGGCTTGGCGAGTGTGTTGGAAGCCGATTTGAGTACATTTAGAGTGAAAATATTACGTATTTTAACAGAATGTGCTATACGAATGCCAGAGAAATGTACTATTTATGCAACATTAGTTGGATTGCTAAACGCAAAAAATTATAACTTTGGTGGCGAATTTGTGGATTACATAGTCAAAACGTTTAAAGAAAATCTCAAAACTGGTAAATGGAACGCAGCCCGTTATTGCTTGAGGTTTATAGCTGATCTAGTTAACTGTCATGTGTTGGCTGCTTCATCCCTATTGACATTGTTAGAAACATTGGTTGACTGTGCTAATGAAGATGGTGTGCCACAAGTCCGACGTGATTGGTTTGTGTTTGCCGTACTTGCTACTTTGCCATGGGTTGGTAGAGAATTGTATGAAAAGAAAGAATCACAATTAGATCACTTGCTGGTAACCATAGAGGTGTTTTTAAACAAGAGAAGTAAAAAACATTGGCCAGCTCTAAAAGTATGGTCGGCAGACTCACCGCATCTTCAAGAAGAATATTTGGATTGTTTGTGGGCTCAAATCAAAAAGTTAAGACAGGATAACTGGTCAGAAAAACATATACCTAGGCCTTATCTTGCTTTCGATTCTATATTGTGTGAGGCCTTACAACACACACTACCTACAATTCAGCCTCCACCTCACAATGATGGCGACACATATCCAATGCCCCGAGTTATATTCCGTATGTTTGACTACACCGATTGTCCCGACGGCCCTGTACTTCCAGGTGCCCACTCCATAGAAAGGTTCCTTATAGAGGAACACCTCCATAATATCGTTGAAGCATACCATCTAGAGCGAAAGGAATGCGCCGCTCAACTCCTATGTTTTCCGTATAAATCTAAAATCCCACTAGAGTACTGTATCGTTGAAGTTATATTTGCTGAGCTGTTCAATTTGCCGAGACCGAGATATTTGGAGATATGTTATGGATCTATTCTAATTGAGCTCTGTAAGCTGCAACCGTCCACGATGCCGCAAGTTTTGGCACAAGCTACTGAGATTCTTTTCATGAGAATCGATACAATGAATATAGCATGTTTTGACAGATTAGTGAACTGGTTTTCGTATCACTTGAGCAACTTCCAATATCGTTGGTCCTGGGAAGATTGGGAGGGATGCGCTCAGCTAGACCCTGAACATCCTAAGCCGAGATTCATCAGGGAGGTGCTCGGCAAGTGTCTTAGGTTGTCATATCATCAAAGGATCAAAGACATGACACCGGAATCGCTAGCAGCATACGTCCCTCTCAAACCTGAACCCATTTACAAATATGCCATGGAAGGAGCGGCCTCTCTCCCAGGAACAGAAGCTGCTCATCAATTAGTTGTGTGCGTCCGTAATAAATGTACGCCCGAAGAGGCGTTGAACGTGCTCCGGGAACTACCGAACCCTTTACGCGAGGGTGAAGCCAACGCCGCTCATACAGCCTACAACCCGCTCAAGATTGACGTATTTGTCCAAACTCTGCTAAACCTCGGCAGCAAGAGCATCTCCCACAGCTTCGCCGCTATATCTAAGTTCCATTACGTTTTTAAGATCCTAGCGGAGTCTGAAGAAGCCCAGATCTGTGTTCTTCGCAACGTTTGGGAGTTATGGCAGAGGCACAGTCAGATGGTGTGCGTTCTGGTCGATAAAATGTTGAAAACTCAGATCGTTGAATGCAGCGCTGTCGCTACGTGGCTGTTCTCTAAGGAAATGGCGCCATACTTCACCCACGGATATCTATGGGAGATATTACATCTCACCATAGACAAGATGAACAAACACGTCTCCAAATTGAGCAAGGAATTACAAGAAGCAAGGGAGGCGTTGGCGAGAGCCGATTCAAGCAGTTCCGAGTCAGAAGACGAGAGCGGCAGCAAGAAGAAGAAAGACCAAGACAAACCCACGGAAGAGGCGGTGGAGCGTATGGAGGAACGCCTGGAGATGGCGCACACGGACCAGAAGCGTCTGTTCCTGATAGTGTTCCAGAGGTTCATCATGATTCTGTCCGAACACCTGGTGAGAGCCGACACCGACGCGAGGGATCCCCACACCCACTGGTACAGGGCCACCCTCGCCCGGCTGCGACAGGTGTTCCTGCTTCATCACGAACAGGTCCAGAAGTACAGCAGTACGTTGGAAACGCTTCTGTTCACTCAAGACCTGGACCCACACATTCTGGACGTCTTCCATCAATTCGTAGCTCTCACCGCATAA

Protein sequence:

>DPOGS205411-PA
MNRRRAHEEEDGYERLHRKRRRVSENQEIEDRLESLILRVGEKSSSSLESNLEGLASVLEADLSTFRVKILRILTECAIRMPEKCTIYATLVGLLNAKNYNFGGEFVDYIVKTFKENLKTGKWNAARYCLRFIADLVNCHVLAASSLLTLLETLVDCANEDGVPQVRRDWFVFAVLATLPWVGRELYEKKESQLDHLLVTIEVFLNKRSKKHWPALKVWSADSPHLQEEYLDCLWAQIKKLRQDNWSEKHIPRPYLAFDSILCEALQHTLPTIQPPPHNDGDTYPMPRVIFRMFDYTDCPDGPVLPGAHSIERFLIEEHLHNIVEAYHLERKECAAQLLCFPYKSKIPLEYCIVEVIFAELFNLPRPRYLEICYGSILIELCKLQPSTMPQVLAQATEILFMRIDTMNIACFDRLVNWFSYHLSNFQYRWSWEDWEGCAQLDPEHPKPRFIREVLGKCLRLSYHQRIKDMTPESLAAYVPLKPEPIYKYAMEGAASLPGTEAAHQLVVCVRNKCTPEEALNVLRELPNPLREGEANAAHTAYNPLKIDVFVQTLLNLGSKSISHSFAAISKFHYVFKILAESEEAQICVLRNVWELWQRHSQMVCVLVDKMLKTQIVECSAVATWLFSKEMAPYFTHGYLWEILHLTIDKMNKHVSKLSKELQEAREALARADSSSSESEDESGSKKKKDQDKPTEEAVERMEERLEMAHTDQKRLFLIVFQRFIMILSEHLVRADTDARDPHTHWYRATLARLRQVFLLHHEQVQKYSSTLETLLFTQDLDPHILDVFHQFVALTA-