Monarch geneset OGS2.0

DPOGS214951
TranscriptDPOGS214951-TA1854 bp
ProteinDPOGS214951-PA617 aa
Genomic positionDPSCF300280 + 94490-96343
RNAseq coverage294x (Rank: top 38%)
Annotation
HeliconiusHMEL0155890.066.88% 
BombyxBGIBMGA004848-TA0.058.28% 
DrosophilaCG14749-PA2e-6334.34% 
EBI UniRef50UniRef50_D6WQJ61e-8633.65%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQJ6_TRICA
NCBI RefSeqXP_974896.12e-8733.65%PREDICTED: similar to CG14749 CG14749-PA [Tribolium castaneum]
NCBI nr blastpgi|910870755e-8633.65%PREDICTED: similar to CG14749 CG14749-PA [Tribolium castaneum]
NCBI nr blastxgi|1571259702e-8831.08%hypothetical protein AaeL_AAEL010355 [Aedes aegypti]
Group
Gene OntologyGO:00169738.6e-85poly(A)+ mRNA export from nucleus
GO:00056438.6e-85nuclear pore
KEGG pathwayrno:2992193e-53 
 K06063 (SNW1, SKIIP, SKIP)maps-> Spliceosome
    Notch signaling pathway
InterPro domain[243-617] IPR0124768.6e-85GLE1-like
Orthology groupMCL12161 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214951-TA
ATGCGAATATCCGCTCTTACTAAAGCTGCTGAAATCAGTCCACTAGTTACAGAAGTTACTATTGGCCCAAGAAGTCCAAAGCAGAAAGAAAAGTCTAAAATTAACCAAGAATTATCTCCAGATACTAAAGAAATGTTTAATCTATTAGAAGACAAATTAGGCGAGGATCTCCGATACACATTAATACTGAAAAAGTATGAAAACGAATTAAAAGAAACATCTGAAAACCTTGTTAGGAACCTTATTGAAAATATGGTCACCACTTATTCAGAAACCATGAAGAAATATGTAAAGAAACATAATGAAGAATGTAAACGAAAACTGTTGGACCTTAGAGCCAAAAAACTTCAAATGGCAAAACTTTTAAGAGACAATGATAATGTTTTAGTGTTAGAAAAAGCCAAGCTAGATGAGAAGCAATGTGAAATGATAAACCAACAAACATTAGACAATATGAATAGAATGTTAGAAGAACAGAAAAAAGCTGCAATTGTTGATAGTCATTTAAAAATAAGCATGCACTACAATGAAATATCAAATGTTGCTAATTCAGAACCACTAGCTAATGAAATTTTAGAAAAGTATATACAGTCAATAAATACAGTTATTGCAAATATTAATTCATTGATGACTCTATGTAAAACGGGACCCATAACTGATAAAGAAGTAAAGCAATCTGAAGTACTGGCGGCTAACCTCGAAAATTTAAAAAACAAGTTTCTTGATGAAGTAGAAAAATTTAAAGAGGAAATAAGGTTTAAAAAACAAAAGGAACAAGAATTATTATTAGAAAAACAAAAAGAAGAACAGGTTAAACAGAAACAATTAGAAGAAAAACAAGCTCAAGAATTAAAAGCAGCCGAAGAAGCTAAAATTCAACAAGCTGTTCTTGACCAAGCTAAAAGGGCACAGTCTATGTTTTATTCTGAGAAAAATTATACCTACTATATGGAGTTAAGGGACTTTCTTGACAAGTATGAAAATTCATACAAAGATTTACTACAAGACAATAATTGTAAAAAGTTCAGATTTGATTGCCAGAAAGCAGTGAACACACCTGTTAATGCACTTTCATCCGTGAGTGGTGTACACATGAGAGACAAATTTGATAAACTGTCAAAACTACTTAAAGGGGAAAGAGTTCAGGTTCTGGACACATTTGTCACAGCCACTCAGCATCCACAAGGATTACCATATTGCACAGCTCTATTAGCAAAGAAGATTGTGAGGCAGGGAGATCTTTTGGTGTCAAGTAATCCTGATGCTGCTTTTCCTTTGGCTGCTGTAACTGTTGCTTTGTGGTCACAGTTCCCGGAATTTGGTAAACTACTTGAAGCTTACTTCCATAGATTCTGTCCCTATTTAGTGCCAATGCTTCTCCCTCAGAAGGAAGGTCAAACTGACAAAGAGTTTTATATATCTCGAGGATATACATATAATGATGAAGATGTTGTTGAAAAACAGGATAAATTCTTGAGAAGAATGTCGGGCATATTCAGGCTCAGATGTGCTATGTGGATAACAAGTACACCCAGATTCCTGAATGTTTCCAATCCAAATGGTATGGGCTTCTCTTGGAGATGGCTGGCTTCTTTTATCAATCTTAAGCCGGAACCAGACATAAGTGCAACTTTGCTGCATGACTTTTTCATAGTTTGCGGTTCACAGTTCTTAAAGTTGTATGGCAAACAGTGTGTAAAAATAATAAAATTGTTAAGTACAGAATATTTAAGTATATTACAAAATATTGATGAAGGTGGACCCAAGACTAGATTAGAAGTTTTTCTACAGAATGTCATAAAATCAGGTCACATAGAACCTCCAAGTGGAATACTTCCACCCAATATTTGGTAA

Protein sequence:

>DPOGS214951-PA
MRISALTKAAEISPLVTEVTIGPRSPKQKEKSKINQELSPDTKEMFNLLEDKLGEDLRYTLILKKYENELKETSENLVRNLIENMVTTYSETMKKYVKKHNEECKRKLLDLRAKKLQMAKLLRDNDNVLVLEKAKLDEKQCEMINQQTLDNMNRMLEEQKKAAIVDSHLKISMHYNEISNVANSEPLANEILEKYIQSINTVIANINSLMTLCKTGPITDKEVKQSEVLAANLENLKNKFLDEVEKFKEEIRFKKQKEQELLLEKQKEEQVKQKQLEEKQAQELKAAEEAKIQQAVLDQAKRAQSMFYSEKNYTYYMELRDFLDKYENSYKDLLQDNNCKKFRFDCQKAVNTPVNALSSVSGVHMRDKFDKLSKLLKGERVQVLDTFVTATQHPQGLPYCTALLAKKIVRQGDLLVSSNPDAAFPLAAVTVALWSQFPEFGKLLEAYFHRFCPYLVPMLLPQKEGQTDKEFYISRGYTYNDEDVVEKQDKFLRRMSGIFRLRCAMWITSTPRFLNVSNPNGMGFSWRWLASFINLKPEPDISATLLHDFFIVCGSQFLKLYGKQCVKIIKLLSTEYLSILQNIDEGGPKTRLEVFLQNVIKSGHIEPPSGILPPNIW-