Monarch geneset OGS2.0

DPOGS208508
TranscriptDPOGS208508-TA1821 bp
ProteinDPOGS208508-PA606 aa
Genomic positionDPSCF300064 - 338494-342989
RNAseq coverage321x (Rank: top 36%)
Annotation
HeliconiusHMEL0114930.064.71% 
BombyxBGIBMGA008447-TA0.073.46% 
Drosophilasbr-PA4e-12639.72% 
EBI UniRef50UniRef50_D6WXF58e-16047.15%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WXF5_TRICA
NCBI RefSeqXP_968226.11e-16047.15%PREDICTED: similar to small bristles CG1664-PA [Tribolium castaneum]
NCBI nr blastpgi|910890113e-15947.15%PREDICTED: similar to small bristles CG1664-PA [Tribolium castaneum]
NCBI nr blastxgi|910890111e-15447.15%PREDICTED: similar to small bristles CG1664-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056343.7e-23nucleus
GO:00510283.7e-23mRNA transport
GO:00055153.8e-20protein binding
GO:00001663.5e-18nucleotide binding
GO:00068101.8e-08transport
GO:00056221.8e-08intracellular
GO:00064069e-06mRNA export from nucleus
GO:00054879e-06nucleocytoplasmic transporter activity
GO:00057379e-06cytoplasm
KEGG pathway 
InterPro domain[537-606] IPR0056373.7e-23TAP, C-terminal
[538-606] IPR0090603.8e-20UBA-like
[104-172] IPR0126773.5e-18Nucleotide-binding, alpha-beta plait
[362-512] IPR0020751.8e-08Nuclear transport factor 2
[99-167] IPR0152459e-06Tap, RNA-binding
Orthology groupMCL10561 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208508-TA
ATGCCGAAACGTGGAGGAAGAATTCGCAATTGGAAACCCAATGAACATTTTGAACATGATGATCGAGCTCACAGCAACAATACAAGACGTGTCAGCTTCAAATCCGGTTCATATAAGGGGAAAAGCAAATTTCGATCCTGGAACGATGCGTCCCAACTGCTGCTCGATGGTGATATTGACATGGGCGCATCTGGAGGTTACCCTATAAGGAAGGGCTTCAGAAATCAAAGAGGTCGGTTAAGCTCTCCAGCACCCCGGACCGCCAAAAGAAAATTTATACCTGGACCTTTGCCGTGGTATCAGATTGTAATACCATACGGAGCTAAACATGATAAAGACGTTATTTTACGATCACTATTAGGCTTTATATCGCCAGAAGTGTTCATACCACATTATTACAAAATAAATGGAAATACAGCTGTTTTCTATGTTGATGATGTTAAAATTGCTGAAAAACTATATGAGGCTGATAGAAAAATTGTTATGTCCGATGGGTTTAAACTTCTTATACTTGTAAGGAACAGTGTACCAAATGTAAATATTGATGCCGATATGAAAGAGAAAATGAAACTCGTCATGGCAAAAAGATACAATGCAGCTACTAAAGCCTTGGACTTAACCAAATTTCATGCCGACCCAGATTTAACAGATATATTCTGTGCACTTTTCCGACCCATCATAATGTCTGTTGCTATTGATATAATGGCAGACAATATACCGGATTTAGAAGCACTCAACTTAAATGAAAACAAAATCCATGGAATGGAACACATGAAGGTCCTTTGTACAAAGTTGAAGAATTTGAAGATTCTTTATTTGGGAGACAACAGGATTACAACTCTGGCTGCTCTAGAACCTTTGAAACCTTTGCCATTGGTGGAACTTTATCTTAAAGGAAACCCACTAGTTTCTAGGTTCAACGATCATGATATTTACGTCAGCGACGTGCGGAAAAAATTCCCCAAACTCATCAGATTGGATGGCGTAGACCTGCCACCAGCCATCGGCTTCGACGTATCTGAAGATTTATCCCTCCCTTCCCGTCAACAATCCTTCCTTATAGACCCAGCCGGCCAGAATCTCGTTAGAGAATTCCTTACACAGTACTTCGCAATCTACGATTCCGATTCCCGTCAGCCATTATTAGAGGCATACCACGAAACAGCTACTATGTCAATGGCGGCCGGCTATTTAAGTAATGAAGGACGAAATGTACCAGGAAATAAATTAAACGCCTACATATCTAACAGTCGTAATATTATGAGGATAACCGACAGGGAATCCCGTCGACGTTACCTCCGTACAGGAAGACTACAAGTTGTTTCGTTTCTGTCAGACTTACCCAAGACCAATCATGACCTGATGGGTTTTGCTGTTGATCTACTTGTTTTCACTCCAGCGATGATAGTGCTAACAATGAATGGTGTCTATAGAGAGACAACAGCATATGGCAATCCGACTCGGTCATTCCATAGGACTTTTGTTATTATACCAAATGCCACTGGTGGATTTTCAATTACTAATGACATGTTGTTTGTCAGTAATACTACTAAGGAGCAGGAGGATAAGTCATTTTCTGGCGGTGAAGTAGCTCCTTCAAGTTCAACTTCAGCACCTCCTGTGGCAACATTAGCTGCAACCCCATCATATGATGAGAATCAACGAATGATGCTTAACATGCTCTGCCAACAAACCGGCATGAACGAACATTGGAGCGTTAACTGTCTACAAGAAACTGGTTGGGACTATCAAAGAGCGTTGTTTATATTCAATCAACTGCAGTCGGAAGGCAAGATTCCACCGGACGCTTTTGTTAAGTGA

Protein sequence:

>DPOGS208508-PA
MPKRGGRIRNWKPNEHFEHDDRAHSNNTRRVSFKSGSYKGKSKFRSWNDASQLLLDGDIDMGASGGYPIRKGFRNQRGRLSSPAPRTAKRKFIPGPLPWYQIVIPYGAKHDKDVILRSLLGFISPEVFIPHYYKINGNTAVFYVDDVKIAEKLYEADRKIVMSDGFKLLILVRNSVPNVNIDADMKEKMKLVMAKRYNAATKALDLTKFHADPDLTDIFCALFRPIIMSVAIDIMADNIPDLEALNLNENKIHGMEHMKVLCTKLKNLKILYLGDNRITTLAALEPLKPLPLVELYLKGNPLVSRFNDHDIYVSDVRKKFPKLIRLDGVDLPPAIGFDVSEDLSLPSRQQSFLIDPAGQNLVREFLTQYFAIYDSDSRQPLLEAYHETATMSMAAGYLSNEGRNVPGNKLNAYISNSRNIMRITDRESRRRYLRTGRLQVVSFLSDLPKTNHDLMGFAVDLLVFTPAMIVLTMNGVYRETTAYGNPTRSFHRTFVIIPNATGGFSITNDMLFVSNTTKEQEDKSFSGGEVAPSSSTSAPPVATLAATPSYDENQRMMLNMLCQQTGMNEHWSVNCLQETGWDYQRALFIFNQLQSEGKIPPDAFVK-