Monarch geneset OGS2.0

DPOGS209018
TranscriptDPOGS209018-TA1101 bp
ProteinDPOGS209018-PA366 aa
Genomic positionDPSCF300209 + 108834-113694
RNAseq coverage438x (Rank: top 28%)
Annotation
HeliconiusHMEL0025462e-8152.88% 
BombyxBGIBMGA012556-TA2e-10765.79% 
DrosophilaKap-alpha1-PA3e-8251.16% 
EBI UniRef50UniRef50_O606841e-8455.19%Importin subunit alpha-7 n=362 Tax=root RepID=IMA7_HUMAN
NCBI RefSeqXP_001661138.11e-9055.96%importin alpha [Aedes aegypti]
NCBI nr blastpgi|2607905153e-9256.68%hypothetical protein BRAFLDRAFT_279360 [Branchiostoma floridae]
NCBI nr blastxgi|2607905152e-8956.68%hypothetical protein BRAFLDRAFT_279360 [Branchiostoma floridae]
Group
Gene OntologyGO:00054887.5e-73binding
GO:00068864.5e-17intracellular protein transport
GO:00056434.5e-17nuclear pore
GO:00056344.5e-17nucleus
GO:00085654.5e-17protein transporter activity
GO:00066064.5e-17protein import into nucleus
GO:00057374.5e-17cytoplasm
GO:00055151.1e-10protein binding
KEGG pathwayyli:YALI0E13992g6e-06 
 K08332 (VAC8)maps-> Regulation of autophagy
InterPro domain[14-338] IPR0160247.5e-73Armadillo-type fold
[141-335] IPR0119894.4e-43Armadillo-like helical
[15-60] IPR0026524.5e-17Importin-alpha-like, importin-beta-binding domain
[83-120] IPR0002251.1e-10Armadillo
Orthology groupMCL34786 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209018-TA
ATGGATTATTTAAAAGTCAACACATTGATGTCCGGAGGTCACAAGCATCGTTACAAGACAGCAGGATTAAGTGCTGATGAATTGCGACGTCGGCGTGAAGAGGAAGGGGTCCAACTTAGAAAACAAAAACGGCAACAACAACTATTTAAACGTCGAAATGTTAAAATGCCTAGCCCTTATGCCACTGATGCGCCTTTACAGTTTGAAGCTGCATGGGCCCTGACGAACATAGCATCTGGTTCATCGGAACAGACCTGTATAGTTGTGGAGTGTGGTGCTGTTCCAGTACTGGTACAGTTGGCCAGCGAGGGTCGTGATGAGGTTCGTGAGCAGGCTGTCTGGGCGCTCGGCAACGTTGCTGGGGATTCGCCCCGCTGTCGTGACGTCGTGCTGGCCGCGGGCTTACTGCCGCCTCTGCTCGAGCACAATAAGTACACAGTAGTGTCAGCGGCTCTGCGTGCGGTTGGCAACATTGTAACTGGCAACGACGCCCAGACCCAAGCCGTACTGAACTGTAACCCGCTGCCGAGTCTACGAGCACTGCTACGGTCCTCTACGGAGACGCTACGGAAGGAGGCCTGCTGGTCGCTGTCCAATATAACGGCTGGCAACGCAGCACAGATACAGACGGTAATAGACGCAGATATTATACCAATTTTGATAGAAATATTAAAATCAGCCGAATTTAAAACAAGAAAGGAAGCTGCTTGGGCGATAACCAATGCTACAAGTGGTGGGACGCACACACAGATATGTTATTTGGTCGAACAAGGTTGTATACCACCCCTATGTGATCTCCTGACTCTGACCGACACCAAGACTGTACAAGTGGCGCTCAACGGACTTGAAAATATATTAAAGGCTGGACAATCATATCGTAGGAATCCATTCGCAACACTTATTGAAGAATGCTTTGGTGTGGATAAGATCGAGTTCCTGCAGTCACATGAGAATTTGGAGATTTATCAGAAATCGTTTGAGATCATTGAGAATTACTTCGGTTCAGAGGGTGAAGACGTACGACTAGCGCCTGACACATCCGCCGACACCTTCACCTTCAATGCTGAGCATGCCGTACCCACTGGAGGGTATCAGTTCTGA

Protein sequence:

>DPOGS209018-PA
MDYLKVNTLMSGGHKHRYKTAGLSADELRRRREEEGVQLRKQKRQQQLFKRRNVKMPSPYATDAPLQFEAAWALTNIASGSSEQTCIVVECGAVPVLVQLASEGRDEVREQAVWALGNVAGDSPRCRDVVLAAGLLPPLLEHNKYTVVSAALRAVGNIVTGNDAQTQAVLNCNPLPSLRALLRSSTETLRKEACWSLSNITAGNAAQIQTVIDADIIPILIEILKSAEFKTRKEAAWAITNATSGGTHTQICYLVEQGCIPPLCDLLTLTDTKTVQVALNGLENILKAGQSYRRNPFATLIEECFGVDKIEFLQSHENLEIYQKSFEIIENYFGSEGEDVRLAPDTSADTFTFNAEHAVPTGGYQF-