Monarch geneset OGS2.0

DPOGS214418
TranscriptDPOGS214418-TA1524 bp
ProteinDPOGS214418-PA507 aa
Genomic positionDPSCF300069 + 264989-271465
RNAseq coverage1105x (Rank: top 11%)
Annotation
HeliconiusHMEL0207833e-10155.37% 
BombyxBGIBMGA011351-TA0.069.29% 
DrosophilaPen-PA4e-10843.44% 
EBI UniRef50UniRef50_E0W1199e-12345.94%Importin alpha-2 subunit, putative n=2 Tax=Neoptera RepID=E0W119_PEDHC
NCBI RefSeqXP_975293.14e-13148.15%PREDICTED: similar to importin alpha 1a [Tribolium castaneum]
NCBI nr blastpgi|910773948e-13048.15%PREDICTED: similar to importin alpha 1a [Tribolium castaneum]
NCBI nr blastxgi|910773941e-12548.15%PREDICTED: similar to importin alpha 1a [Tribolium castaneum]
Group
Gene OntologyGO:00054882.1e-98binding
GO:00068862.9e-14intracellular protein transport
GO:00056432.9e-14nuclear pore
GO:00056342.9e-14nucleus
GO:00085652.9e-14protein transporter activity
GO:00066062.9e-14protein import into nucleus
GO:00057372.9e-14cytoplasm
GO:00055152.2e-10protein binding
KEGG pathwayddi:DDB_G02872218e-09 
 K10590 (TRIP12)maps-> Ubiquitin mediated proteolysis
InterPro domain[19-484] IPR0160242.1e-98Armadillo-type fold
[20-481] IPR0119893.9e-96Armadillo-like helical
[5-94] IPR0026522.9e-14Importin-alpha-like, importin-beta-binding domain
[350-388] IPR0002252.2e-10Armadillo
Orthology groupMCL25839 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214418-TA
ATGTCCGACGGAGTTAATAGATCACGTTTAGCGTCCTACAAAAATGCCGGCCAAGGAGTTACAGAGCTTCGTCGTAAGAGAGCTGAATTGAGCGTCTCCCTACGTAAACAGGCTCGTGATGAACAGCTTCTTAAGAGGCGTGCCATGTCTCCTGAAGCGGGAGAGGAGCCAGAAACAGAAAAGGTCATGACACCAAGTGAAATAGTCCAGGGTTTGAAGTCATCCGATCTATCGGTGAAGACTGCATCGGCTAGAGCTGCTCGTCGGATGTTGTCCAGAGAACAGAACCCTCCTATTACGGTGATGGTTAATGCTGGTGTTATACAGCCCTTGGTCAGCTCGTTGGATAGAGATGATTGTCCTGATCTTCAGTTTGAGGCGGCTTGGGCCATCACTAACATTGCGTCTGGGACCCATGATCATACGTTGGCTGTTATAGAAGGTGGTGCAATCCCGAAATTGGTGAATCTTCTATCAGCTGGTGGTGCTGTGGGGGAACAGAGCGCGTGGGCCCTTGGTAACATCGCTGGCGACGGGCCCATACCCCGCGACTCAGTACTGGCAGCCTCAGCATTACCCGCCTTGCTACCTCACCTCTCGATTGGAGCGCCGCCCGCCACTCTCAGAACAGCCGTTTGGACCTATAGCAATCTCTGCAGGAACAAAAATCCACTCGTCCGCTTCGAGTTAGTATCTCCCGCGCTGCCTGCTATATCTGAACTACTAACGGTCGCTGATCAGGACGTGTTGGCGGATGCATGCTGGGCGCTATCTTATTTAACCGACGGGCCTGATGAGAGGATTGAGGCGGTGCAGAGCACTCCGCACCTCCTTGGAAGGCTCGTCTCCTTGCTATCTCACCGAGCGGCATCAGTTAGGACACCAGCGCTGAGAGCTGTGGGTAACATGCTGACCGGCTCAGACAGACAGACGGATAGGGTCTTGGAGGCTGGATGTCTCGATCCATTGGCGACGCTCCTGAAATGCGGCAGATCCTCGCTGGTGAAGGAGGCCGCTTGGGCGTTGTCCAACGTGTTCGCTGGTACGAGCCAGCAGATACAGATAGCTATCGATTCTGGTGTCTTACCAGTCTTAGTGTCTGTTTTATCTGCGGATGATGCTAAGTGTCAGAAGGAGGCTGCGTGGGCTATCAGCAATGTGTGTCTGGGTGGAACACCAGCTCAGTTAGACAGGCTGATAGGGGCCGGGTTTCTGGAGCCCTACTGTGCGCTGTTGGAGGCGCCCGATCAGCGCACGGTCGCTGTAGTATTGGATGGTCTCGCTCATTTGCTCCAGGCGGCAGCGAAATTCGGTCAAGTGGATCCTCTCTGTGTGAAGCTAGAGGAGGTTGGAGCCCTGGACCGCATCGAGGGACTCCAACAACACGAGAATGAACAGATCTACCGCAAAGCCTTGCATATCCTCGACACATACTTCGTGGACCAGGAAGATCAGTCCCAGCCGGAGCAGATTGGTGATGAGTACCACTTCAGTGCCCACGACGACCAACAAATACAATTCTAA

Protein sequence:

>DPOGS214418-PA
MSDGVNRSRLASYKNAGQGVTELRRKRAELSVSLRKQARDEQLLKRRAMSPEAGEEPETEKVMTPSEIVQGLKSSDLSVKTASARAARRMLSREQNPPITVMVNAGVIQPLVSSLDRDDCPDLQFEAAWAITNIASGTHDHTLAVIEGGAIPKLVNLLSAGGAVGEQSAWALGNIAGDGPIPRDSVLAASALPALLPHLSIGAPPATLRTAVWTYSNLCRNKNPLVRFELVSPALPAISELLTVADQDVLADACWALSYLTDGPDERIEAVQSTPHLLGRLVSLLSHRAASVRTPALRAVGNMLTGSDRQTDRVLEAGCLDPLATLLKCGRSSLVKEAAWALSNVFAGTSQQIQIAIDSGVLPVLVSVLSADDAKCQKEAAWAISNVCLGGTPAQLDRLIGAGFLEPYCALLEAPDQRTVAVVLDGLAHLLQAAAKFGQVDPLCVKLEEVGALDRIEGLQQHENEQIYRKALHILDTYFVDQEDQSQPEQIGDEYHFSAHDDQQIQF-