Monarch geneset OGS2.0

DPOGS203904
TranscriptDPOGS203904-TA2904 bp
ProteinDPOGS203904-PA967 aa
Genomic positionDPSCF300005 - 1100223-1106577
RNAseq coverage1647x (Rank: top 8%)
Annotation
HeliconiusHMEL0179310.077.90% 
BombyxBGIBMGA002013-TA0.079.03% 
Drosophilamsk-PA0.054.38% 
EBI UniRef50UniRef50_E2B4U30.059.01%Importin-7 n=13 Tax=Coelomata RepID=E2B4U3_HARSA
NCBI RefSeqXP_001843364.10.054.14%importin-7 [Culex quinquefasciatus]
NCBI nr blastpgi|1700309780.054.14%importin-7 [Culex quinquefasciatus]
NCBI nr blastxgi|3800266890.055.78%PREDICTED: importin-7 [Apis florea]
Group
Gene OntologyGO:00054884.4e-116binding
GO:00068864.3e-15intracellular protein transport
GO:00085654.3e-15protein transporter activity
KEGG pathway 
InterPro domain[1-932] IPR0160244.4e-116Armadillo-type fold
[649-772] IPR0119894.3e-58Armadillo-like helical
[22-101] IPR0014944.3e-15Importin-beta, N-terminal
Orthology groupMCL11372 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203904-TA
ATGGACACCAGAAAGCTGACGGAAATTTTGCGAGCTACTATTGATCCTAACCAAAGGCAACAAGCAGAAGAACAGCTTTCCCAGATTCATAAAATAATTGGATTTGCTCCCTCTTTGCTGCAAGTGGTTATGCTTAATGATGTTACAATCCCTGTCCGTCAAGCTGGAGTTGTATATCTGAAGAATTTGGTAACCTCGGGTTGGTTAGAGAAAGAGGCAGAAGATGGAGAGCCAATACCATTTAGTATTCATGAACAGGACAGAGCCATGATAAGAGATATCATTGTGGATGCCATAGTACAGGCACCTGATATAATCAAAGTTCAGCTCTGTGCTGATCAATCTCCAGAATCTGTTACCATACAAAAACAAATACTTAAGTGTTTCTATGGTCTCATAAAGTTTAATTTACCACTGGGTTTAATCACAAAAGAAATATTTACGAAATGGATGGAAGTTCTCCGTTCTGTAATGGAGCAGCCTGTCCCAGAACACACTTTGCAGGTTGATGAGGATGAACGCATGGAACTTCCTTGGTGGAAATGCAAAAAATGGGCAGTTCATACCCTTTATCGTTTATTTGAAAGATATGGAAGCCCAGTAAATGTTCGAGATGAATATGTTCAGTTTGCAGAGTGGTACTTAACTACTTTCACTGGTGGGATATTAGAGGTTCTTCTCAGAGTTCTGGATCAATACAGAAACAAAATCTATGTGTCACCACGTGTACTGCAGCAGACCATTAGCTATATTGATCAGTGCATAAGTCATGCACATTCTTGGAAATTACTGAAGCCTCACATGTTTGCTATTATTAAGGATGTGCTATTTCCACTTATGTCCTACTCTGAAGCTGATGAGGAGTTATGGTTCTCTGATCCCCATGAATATATACGTATCAAATTTGATATATTCGAAGATTTTGTTTCTCCCGTAACAGCAGCACAGACATTACTCATATCATGTTGTAAAAAGAGGAAAGACATGTTGGAAGAAACTATGCATTTATGTATGCAAGTCCTCAGAAACCAGAATGGTGAATATGGACCCCGCCAAAAAGATGGTGCTTTACACATGGTGGGAACACTTGTTGATATCCTTATAAAGAAAAAGTTTTACAATGAAGAAATTGACCCACTCCTCAGCGAATTTGTTATTCCTGAGTTTCATAGTCAGTTGGGTTACATGAGGGCCAGAGCTTGCTGGGTGCTTCACTGTTTCTCAAGTATTCGTTTCAAGAGTGAAATGCTATTGGTAGAAGTTGTCCGTCTCACAGTTAATGCTTTCCTTAATGATACTGAGCTGCCTGTGAAAGTTGAGGCAGCTATAGCCATACAAATGCTACTGACATCACAAAACAAAGTTCATAAATTACTAGAACCTCAAGTTAAAGCTGTCACCACAGAGCTTCTAAATGTAATTCGTGAAACAGAAAATGACAACATTGCTAATGTTCTGCAAAAGATTGTACCTTTGTACACAGAACAATTAATGCCAATGGCCTATGAGATCACAGATCACTTAGCAACTACATTTAGTAAAGTTATTGAGACAGATTCTGGAACTGATGAAAAGGCTATCACAGCCATGGGTCTTCTAAACACTATGCAAGCAGTACTCACTGTTATGGAAGATAACCCAGAAATTATGTTGCAGCTAGAAAGTACAGTTCTTCGTGTAGTTGGACACATTTTACATCACAATATTATAGAATATTATGAAGAAGCTATGACTTTATTGTGTAATTTAACTGCAAAATCAATATCCAAAGATTTATGGACAGTTCTTGAGATGTTGTACCAGGTGTTTGAAAAAGAAGGTTTTGACTACTTCACAGACATGATGCCAGTATTGCACAATTATATCACAGTTGACACAAATGCTTTTCTCTCCAATGAAAATCATATATTAGCTATGTTCAATATGGTTAAAGTGATCCTAAATTCTGATGCTGAAGATGAATCTGAGATATATGCAGCAAAACTTCTAGAAGTAATAGTCTTACAATGTTCTGGTAAAATAGATAATTGTTTACCATCATTTGTGGAACTAGTTCTTAGCAGACTTACAAGGAAAGTTAAGACTTCAGAACTTAGAACAATGCTACTGCAGGTGCTCATAGCGATTTTATACTGCAACCCTCACCTCCTGTTTACGATTCTGGAAAAACTCCAAGAGTCTGTGCCGAATGCCTCAATTACCCAACACTTTATAAAACAATGGATACACGACACAGACTGTTTTATGGGTTTGCACGACCGGAAGCTGTGTGTGTTGGGCATTTGTACTTTACTAGAGATGGGCCCCCAGAGACCCAATCTAGACGAGGTAATACCAAAATTATTGTCCTCGTGTTTGGTGCTGTTTGATGGTCTCAAGAGAGCATACGAAGCAAGAGCCGAAGCTGACGAAGACACGTCGTCTGAAGAAAATGATGAGGAAGAAGATGAAGAAGTTCTTTCAAGTGACGATGACGACGTTGATCAAATGACAAACGAATATCTTGAAAATCTCGCACGAATGGCGACAAAGAATAGCAGCCAACAGGGCGTCAACCTTACTGCGAAAATAGAAGAATATGAGAGTGACGATGATGATGAAAGCTATGAGCCAGATGAGACTGCTATTGAATGTTACACAACACCACTTGATGAGAAAGACTGTACCGTGGATGAATACATAAAATTCAAAAACACATTATCTGCACTATCAACCAATGAACCTACATTGTACCATGCTCTGACAAGTGTTCTGACTGAGGAACAGAGGAAACAATTACATGCAGTTTTTGTACTAGCTGATCAACGCAAGGCACAACAAGATAGTAAACGGATTGAACAAAGTGGCGGTTACTCATTTACGGTTCCCGCACAAGTACCTACAACATTTAAATTTGGTTCATAA

Protein sequence:

>DPOGS203904-PA
MDTRKLTEILRATIDPNQRQQAEEQLSQIHKIIGFAPSLLQVVMLNDVTIPVRQAGVVYLKNLVTSGWLEKEAEDGEPIPFSIHEQDRAMIRDIIVDAIVQAPDIIKVQLCADQSPESVTIQKQILKCFYGLIKFNLPLGLITKEIFTKWMEVLRSVMEQPVPEHTLQVDEDERMELPWWKCKKWAVHTLYRLFERYGSPVNVRDEYVQFAEWYLTTFTGGILEVLLRVLDQYRNKIYVSPRVLQQTISYIDQCISHAHSWKLLKPHMFAIIKDVLFPLMSYSEADEELWFSDPHEYIRIKFDIFEDFVSPVTAAQTLLISCCKKRKDMLEETMHLCMQVLRNQNGEYGPRQKDGALHMVGTLVDILIKKKFYNEEIDPLLSEFVIPEFHSQLGYMRARACWVLHCFSSIRFKSEMLLVEVVRLTVNAFLNDTELPVKVEAAIAIQMLLTSQNKVHKLLEPQVKAVTTELLNVIRETENDNIANVLQKIVPLYTEQLMPMAYEITDHLATTFSKVIETDSGTDEKAITAMGLLNTMQAVLTVMEDNPEIMLQLESTVLRVVGHILHHNIIEYYEEAMTLLCNLTAKSISKDLWTVLEMLYQVFEKEGFDYFTDMMPVLHNYITVDTNAFLSNENHILAMFNMVKVILNSDAEDESEIYAAKLLEVIVLQCSGKIDNCLPSFVELVLSRLTRKVKTSELRTMLLQVLIAILYCNPHLLFTILEKLQESVPNASITQHFIKQWIHDTDCFMGLHDRKLCVLGICTLLEMGPQRPNLDEVIPKLLSSCLVLFDGLKRAYEARAEADEDTSSEENDEEEDEEVLSSDDDDVDQMTNEYLENLARMATKNSSQQGVNLTAKIEEYESDDDDESYEPDETAIECYTTPLDEKDCTVDEYIKFKNTLSALSTNEPTLYHALTSVLTEEQRKQLHAVFVLADQRKAQQDSKRIEQSGGYSFTVPAQVPTTFKFGS-