Monarch geneset OGS2.0

DPOGS208715
TranscriptDPOGS208715-TA3282 bp
ProteinDPOGS208715-PA1093 aa
Genomic positionDPSCF300043 - 58710-65595
RNAseq coverage3946x (Rank: top 3%)
Annotation
HeliconiusHMEL0152630.084.92% 
BombyxBGIBMGA003361-TA0.073.71% 
DrosophilaKarybeta3-PA0.058.30% 
EBI UniRef50UniRef50_Q9VN440.058.30%FI07923p n=37 Tax=Eumetazoa RepID=Q9VN44_DROME
NCBI RefSeqXP_001607590.10.063.25%PREDICTED: similar to importin beta-3 [Nasonia vitripennis]
NCBI nr blastpgi|1839793030.086.37%Karyopherin beta 3 [Papilio xuthus]
NCBI nr blastxgi|1839793030.086.37%Karyopherin beta 3 [Papilio xuthus]
Group
Gene OntologyGO:00054882.4e-139binding
KEGG pathway 
InterPro domain[811-936] IPR0119892.4e-139Armadillo-like helical
[6-938] IPR0160241.8e-129Armadillo-type fold
Orthology groupMCL11401 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208715-TA
ATGGCAGGAGACCAGGCTCAATTTTATCAATTATTAAATACAATATTGTCTATAGATAATGAAACCAGATCACAAGCAGAGAAATTATATAATGATATACCAACCGAAACGAAGGTAGTCCACTTAGTTGGTGCTATCCAAAATGCAGACCTCGGCGAGGAAGCCAGAGAAACCGCAGCAGTACTCCTGCGAAGGTTGCTCAGCGCCGAGTTTTTTGAATTCTTCCCTAAACTCCCTTTTGATCAGCAAGCAATGCTTAGAGAACAGTTGTTGCTCACTCTGCAAATGGATGTTAGTCAACAGCTAAGACGAAAAATTTGCGATGTTGTATCTGAGCTAGCAAGAAATCATATAGATGATGATGGAGTCAACCAGTGGCCGGAATTCCTTCAATTCATGTTCAACTGTGCGAGTGCGCAAGATCCTAACATTAAAGAGGCTGGGATTAGAATGTTCACATCTGTGCCAGGTGTTTTTGGTAATCGTCAGAATGAAAATCTGGATGTTATTAAGAGAATGTTGCTGTCAACACTCCAACCGACGGAATCAGAGGCACTGCAGATGCAGGCTGTTAAAGCCGTCGGAGCTTTTATCCTGCTCCATGATAAAGAACCAGCCATACAGAAACATTTTAGTGATCTCCTTGTACCTTTTATGCAGGTTGTAGTACAATCAATTGAGAAGGCCGATGATGATGCAGCCCTAAAAGTTCTTATTGAACTTGCTGAATCTGCTCCTAAATTCTTGAGACCACAAGTTCAAACAATCTTTCAAGTTTGCATAAAGGTTATTGGGGACAAAGATGGCGAGGACAACTGGCGGCAGTTGGCATTAGAAGCACTAGTGACATTATGTGAGACAGCCCCAGCAATGGTCAGGAAAGTAGTTCCTAATGCCATACAACTACTGACCCCACTCATACTAGATATGATGTGCGAGTTAGAAGAGGAACCTGATTGGGCTGTGCAGGATAATGCCTCTGATGATGACAATGAATTAAACTATGTCGCAGCAGAATCTGCCTTAGACCGTATGTGTTGTGGTCTTGGTGGCAAGATAATGCTTGGTCTGATTGTGGGACAAGTGCCTGAAATGTTGAACTCTCAAGACTGGAAGAGGAGGCATGCCGCCTTAATGGCAGTATCTTCAGCTGGGGAAGGCTGTCACAAACAAATGGAACAAATGTTAGATCAAGTTGTTTCAGCGGTGCTCAACTATCTAACAGACCCCCATCCGCGAGTGCGTTATGCTGCTTGCAATGCTGTGGGGCAAATGTCAACGGACTTTGCACCGGTGTTTGAGAAGAAATTCCATGACAAAGTGGTTCCCGGACTTCTTATGGTACTAGAAGACAATGCTCATCCAAGAGTACAGGCTCATGCAGCAGCAGCCCTGGTTAACTTCAGTGAAGATTGCCCCAAGCAAATCTTAACTCAATACTTGGGACCACTCATGGGTAAACTGGAAGCAATTTTAACTGCTAAATTCAAGGAGTTGGTAGAGAGTGGCACCAAGCTTGTACTAGAGCAGATAGTGACCACGATTGCTTCAGTTGCTGATACTGTGGAAAAAGAGTTTGTGGAATATTACGACAGGCTGATGCCCTGTCTTAAATATATCATTGCCAATGCCACTACTGACGAATTCAAAATGCTTCGCGGCAAGACTATTGAGTGTGTCAGTCTGATAGGCTTGGCTGTTGGTGAAGAGAAATTTATGGCGGATGCATCAGAAGTGATGGACATGCTTCTTAAGACACACTCGGAAGGTGATCAACTGCCGGCCGATGACCCCCAGACTTCTTATCTTATCTCTGCGTGGTCGCGAATTTGCAGAATTATGGGCAAAAAATTCGCTCAGTATTTACCGATGGTAATGGAGCCGGTGATGCGCACCGCTGCTATGAAGCCGGAAGTAGCTTTATTAGATAACGACGACCTCGAGATCATCGAGGGAGAACTTGACTGGCACTTTGTAACCTTAGGAGAGCAACAGAACTTCGGCATAAAGACTGCAGGACTTGAAGATAAAGCGTCCGCCTGCGACATGTTGGTCTGTTACGCCCGAGAATTGAAGGAGGCCTTCGCTGAATACGCCGAAGATGTTGTTAAGCTAATGGTGCCGATGTTGAAGTTTTACTTCCACGACAACGTGCGCACAGCCGCCGCAGAGTCTTTACCTTATCTACTGGAATGTGCGCGAATCCGCGGCCCGCAGTACATTCAGGGCATGTGGGCGTACATCCTGCCCGAATTATTGAAGGCCATCGAGTCCGATCCAGAACAAGATGTCCAAGTGGAACTGCTGAACAGCCTTGCTAAGTGCATCGAACTCTTGGGAACGGGATGTCTGTCTGACGAATCTATGTCTGAAGTGTTGCGCATCCTCAACAAACTACTCGCAGAGCATTTCGAGCGTGCTACTCAACGCAGACAAAGGCTCGCCGACGAAGATTACGATGAGGTTGTAGAAGAACAACTAGCGGATGAAGATAACGAAGATGTATATGGCCTGTCCCGAGTGGCCGACGTGTTACACGCGCTTATGTCCGCCTACCATGAAAACTTCTACCCTCACCTAGACTCGCTGGTTCCTTATCTTGTACAACTCCTTGGCCCCGGACGTCCATACGCCGACCGACAATGGGCTATTTGCATTTTCGACGATGTCATCGAATTCGGCGGTCCCGCCTGCGTGAAGTATCAAGATGTCTTCTTGGAGCCGATGTTGAATGGTCTACGTGAGCCTCAGCCGGAAGTGCGACAAGCCGCTGCCTACGGTTGTGGAGTGTTAGCCCAATTCGGGGGACCAAATTTTGCGGCCGCCTGCGCTCGAGCCGTTCCACTGCTAGCCGCCCTCATCGCCGAGCCAGACTCGCGTTCCGTGGAAAACCTGAACGCTACCGAAAACGCTATCTCTGCTGTTACGAAAATTATAAAATACAATCATTCGCAAATCAATAGGGATGAAATCATCAGGCACTGGTTGACCTGGCTCCCAGTGGTTGAGGACACAGAGGAGGCGCCACATGTATACTCGCTGCTGTGTGAACTGGCAGCCGGCGGCCACCCAGCTCTCGCTACACCGGACGCGCCTCGACACGTGATCGCTACACTAGCCGAAGCCTTCCTGAGAGACGCCGTGCCAAACGATAATCCTGTGTACGCGCAGATGGTCGCTCTCGTCAGGCAAATACAGACGAATGCGGAGCTTTTCAATTCTTGCTTGATGCAATTGAGTAACGACCACAAGGAAGCTCTCAAAGTGGCTCTGCTGACCTAG

Protein sequence:

>DPOGS208715-PA
MAGDQAQFYQLLNTILSIDNETRSQAEKLYNDIPTETKVVHLVGAIQNADLGEEARETAAVLLRRLLSAEFFEFFPKLPFDQQAMLREQLLLTLQMDVSQQLRRKICDVVSELARNHIDDDGVNQWPEFLQFMFNCASAQDPNIKEAGIRMFTSVPGVFGNRQNENLDVIKRMLLSTLQPTESEALQMQAVKAVGAFILLHDKEPAIQKHFSDLLVPFMQVVVQSIEKADDDAALKVLIELAESAPKFLRPQVQTIFQVCIKVIGDKDGEDNWRQLALEALVTLCETAPAMVRKVVPNAIQLLTPLILDMMCELEEEPDWAVQDNASDDDNELNYVAAESALDRMCCGLGGKIMLGLIVGQVPEMLNSQDWKRRHAALMAVSSAGEGCHKQMEQMLDQVVSAVLNYLTDPHPRVRYAACNAVGQMSTDFAPVFEKKFHDKVVPGLLMVLEDNAHPRVQAHAAAALVNFSEDCPKQILTQYLGPLMGKLEAILTAKFKELVESGTKLVLEQIVTTIASVADTVEKEFVEYYDRLMPCLKYIIANATTDEFKMLRGKTIECVSLIGLAVGEEKFMADASEVMDMLLKTHSEGDQLPADDPQTSYLISAWSRICRIMGKKFAQYLPMVMEPVMRTAAMKPEVALLDNDDLEIIEGELDWHFVTLGEQQNFGIKTAGLEDKASACDMLVCYARELKEAFAEYAEDVVKLMVPMLKFYFHDNVRTAAAESLPYLLECARIRGPQYIQGMWAYILPELLKAIESDPEQDVQVELLNSLAKCIELLGTGCLSDESMSEVLRILNKLLAEHFERATQRRQRLADEDYDEVVEEQLADEDNEDVYGLSRVADVLHALMSAYHENFYPHLDSLVPYLVQLLGPGRPYADRQWAICIFDDVIEFGGPACVKYQDVFLEPMLNGLREPQPEVRQAAAYGCGVLAQFGGPNFAAACARAVPLLAALIAEPDSRSVENLNATENAISAVTKIIKYNHSQINRDEIIRHWLTWLPVVEDTEEAPHVYSLLCELAAGGHPALATPDAPRHVIATLAEAFLRDAVPNDNPVYAQMVALVRQIQTNAELFNSCLMQLSNDHKEALKVALLT-