Monarch geneset OGS2.0

DPOGS210133
TranscriptDPOGS210133-TA1539 bp
ProteinDPOGS210133-PA512 aa
Genomic positionDPSCF300261 - 82868-86486
RNAseq coverage4120x (Rank: top 3%)
Annotation
HeliconiusHMEL0116056e-15772.51% 
BombyxBGIBMGA003758-TA0.086.24% 
DrosophilaKap-alpha3-PD0.074.04% 
EBI UniRef50UniRef50_O005050.070.33%Importin subunit alpha-3 n=200 Tax=Metazoa RepID=IMA3_HUMAN
NCBI RefSeqNP_001040340.10.086.05%karyopherin alpha 3 [Bombyx mori]
NCBI nr blastpgi|1140533230.086.05%karyopherin alpha 3 [Bombyx mori]
NCBI nr blastxgi|1140533230.086.05%karyopherin alpha 3 [Bombyx mori]
Group
Gene OntologyGO:00054888.7e-122binding
GO:00068861.3e-20intracellular protein transport
GO:00056431.3e-20nuclear pore
GO:00056341.3e-20nucleus
GO:00085651.3e-20protein transporter activity
GO:00066061.3e-20protein import into nucleus
GO:00057371.3e-20cytoplasm
GO:00055151.2e-11protein binding
KEGG pathwaypic:PICST_745201e-13 
 K08332 (VAC8)maps-> Regulation of autophagy
InterPro domain[19-487] IPR0160248.7e-122Armadillo-type fold
[18-486] IPR0119892.2e-116Armadillo-like helical
[7-91] IPR0026521.3e-20Importin-alpha-like, importin-beta-binding domain
[144-181] IPR0002251.2e-11Armadillo
Orthology groupMCL11403 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210133-TA
ATGGCGACGGATCCAATGAAACACCGCATGCATGTTTTCAAGAACACTGGCAAAGATGTCGATGAAATGCGTAGACGTAGGAACGAAGTGACAGTGGAACTTAGAAAGAACAAAAGAGAAGAAACATTACAAAAACGTCGCAATGTCCCCGTTAGCTATTCAACAGACGAAGATGATATCGATAGAACACTGGCATCCACGGATCTCGAGGAGCTGGTGATGAAAGCGGCCAATGCCGAGAACCCAGAAGAGCAGTTAGCCGCAGTTCAACAGTGTAGAAAGCTTCTGTCCTCCGATAAGAATCCACCTATAGACAGTTTGATAACGACGGGAATTTTACCTATACTTGTTCAGTGCCTTTCTAGAACTGATAACCCGACACTACAGTTCGAGGCGGCTTGGGCCCTCACGAATATCGCGTCAGGAACGTCCGCGCAAACCAACAAAGTCGTTCACGCAGGAGCAGTGCCTCTATTTTTACAATTGCTCATGTCTCCTCATGAGAATGTCTGTGAGCAAGCTGTCTGGGCTCTGGGTAACATCATCGGAGACGGACCAGTCCTGAGAGATTATGTTGTTGAATTAGGAGTTGTAAAGCCCTTACTCAGCTTTATAAAGCCTGGCATCCCCATCACTTTCCTGCGTAATGTTACATGGGTCATTGTCAACCTCTGTAGGAGTAAGGATCCTCCCCCGCCCGTCAAGACCATTCAGGAAATATTGCCAGCACTTAATGAACTTATCACTCACACAGATGTAAACGTGTTAGTGGACACAGTATGGGCTATCAGTTATCTCACTGACGGAGGCAATGATCAGATACAGATGGTCATAGAGTCTGGTATAGTTCCAAAGCTAATACCCCTGTTGTCACACAAGGAGGTTAAGACTCAGACGGCAGCTCTTAGAGCCGTCGGAAATATTGTGACTGGAACAGACGAACAAACACAAGTGGTACTCAACTGTGATGCACTTTCACATTTCCCGGCCTTACTTTCACACCCCAAAGAGAAGATCTGCAAGGAGGCGGTCTGGTTCCTGTCGAACATCACGGCCGGGAACAAGCAGCAGGTGCAGGCGGTGATCGACGCGGGTCTGCTGCCCAAGATCGTGGAGAACCTCAGCAAGGGAGAGTTCCAGACACAAAAGGAAGCGGCCTGGGCCGTCTCCAACCTCAGCATCTCGGGCACTAGCGAACAAGTGGCGGCCCTGGTACAGTGTGGAGTCATTCCACCCTTCTGTAACCTGCTGGACTGTAAGGACTCGCAAGTCATCAACGTGGTGTTGGACGGTCTCAGTAACATGCTGAAGATGGCCGGAGACAGCACGGAGGCGGTGGCCACCATGATAGAGGAGTGCGGCGGCATCGACAAGATAGAGGAGCTGCAAGGACACGAGAAGGTCGAGATATACAAGATGGCCTACGACATCATAGAACAGTACTTCGCTGACGAGGAGGAGGACGCCACGGTGGTGCCGCCGGCAGCCGACGCCACCTTCCAGTTCGAGACCGCCAAGCACGAACCCTTCCGCTTCTGA

Protein sequence:

>DPOGS210133-PA
MATDPMKHRMHVFKNTGKDVDEMRRRRNEVTVELRKNKREETLQKRRNVPVSYSTDEDDIDRTLASTDLEELVMKAANAENPEEQLAAVQQCRKLLSSDKNPPIDSLITTGILPILVQCLSRTDNPTLQFEAAWALTNIASGTSAQTNKVVHAGAVPLFLQLLMSPHENVCEQAVWALGNIIGDGPVLRDYVVELGVVKPLLSFIKPGIPITFLRNVTWVIVNLCRSKDPPPPVKTIQEILPALNELITHTDVNVLVDTVWAISYLTDGGNDQIQMVIESGIVPKLIPLLSHKEVKTQTAALRAVGNIVTGTDEQTQVVLNCDALSHFPALLSHPKEKICKEAVWFLSNITAGNKQQVQAVIDAGLLPKIVENLSKGEFQTQKEAAWAVSNLSISGTSEQVAALVQCGVIPPFCNLLDCKDSQVINVVLDGLSNMLKMAGDSTEAVATMIEECGGIDKIEELQGHEKVEIYKMAYDIIEQYFADEEEDATVVPPAADATFQFETAKHEPFRF-