Monarch geneset OGS2.0

DPOGS200024
TranscriptDPOGS200024-TA3660 bp
ProteinDPOGS200024-PA1219 aa
Genomic positionDPSCF300337 - 118551-137116
RNAseq coverage688x (Rank: top 19%)
Annotation
HeliconiusHMEL0036810.071.61% 
BombyxBGIBMGA012444-TA0.079.38% 
DrosophilaRanbp11-PA3e-15034.53% 
EBI UniRef50UniRef50_E9ID790.043.33%Putative uncharacterized protein (Fragment) n=7 Tax=Solenopsis invicta RepID=E9ID79_SOLIN
NCBI RefSeqXP_396286.30.046.31%PREDICTED: similar to importin 11 [Apis mellifera]
NCBI nr blastpgi|2700013550.047.17%hypothetical protein TcasGA2_TC000164 [Tribolium castaneum]
NCBI nr blastxgi|2700013550.046.76%hypothetical protein TcasGA2_TC000164 [Tribolium castaneum]
Group
Gene OntologyGO:00054881.8e-73binding
GO:00068862.1e-16intracellular protein transport
GO:00085652.1e-16protein transporter activity
KEGG pathway 
InterPro domain[228-1131] IPR0160241.8e-73Armadillo-type fold
[227-1125] IPR0119898.1e-57Armadillo-like helical
[254-322] IPR0014942.1e-16Importin-beta, N-terminal
[115-225] IPR0110221.3e-09Arrestin-like, C-terminal
Orthology groupMCL13360 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200024-TA
ATGCCTTACATTAGAGTATACACTGGGAAAGAAAAAGTTTTTGAGCTTCAAATCGATATTTTTAAGGAATTAAAAGGTCATAAAGTAAAACCGGGTTCAATCAGCTCTCAATTTCATTTTATTCTACCACCGGATCTTCCATCAACCTATGAAGATTCTGTCGCTAAAATATATTACCACATTATATTAAAGGAAAAGGCACAACAGATAATTAAAATTCCTATAACGGTAATAAGTTACGTGAACTTAAACCATCTTCAAGAATACAAGGTGCCAACCATTTATGACATGACAAAGGTCTGTCGTTTCTCTGAGGAGGTAGTGGTTTTTTTGAGAACATATAAATCTTTTGCTCCTCAGCAATTCGTACCTTTCGAAGCGATAATATCTAATGAAAAGAATGTAACCATAACTAAAATTGTAGTTAAACTTATTCAAAAATTGGAGTATAACGTCTCCTCAGGGTATTATAATGCAGAAAAAACTGTTTGTAAGAGCGAGCACCTTGATATTAAAAATGTTGCCAGACAAACATGCACTTTTGGTATGAATGTTCCTAAAGTAATTCCTTCAGCACTGAATGACTACAATCATATGGTAAAAAGTTCATATTTGTTACGGGTTAAAATAAGTTTTCGGTTTCATTTTCCGTTATATTCTGATATACCTGTTGTTGATAAAATGGATCCAAACATTTATGCTCTAGTGCTTGACACACTAAACAGAGCGACCAGCCAAGACGCAGAAGTCCTGAAGCCAGCTGAGAAGAAACTCCAAGAATGGGAAGTAGAGCCCGGCTTTTATTCAGTGCTATTGAATGTCCTGTCGAATCATTCAATCGACATCAACGTGAGGTGGCTGGCTGTGATGTGTTTTAAGAACGGTGTGGATCGCTATTGGAGACGAAACGCACCCAATGCTATCACGGATGAAGAGAAGCAGAAGCTCAGACAGGGTCTGCTGACCTCTCCAATACTCAGTGAACCAATCGCGCAGATTGCTACACAACAGGCTGTTCTTATATCTAAAATAGCAAGATTTGACTGTCCAACAAACTGGCCAACACTCCTACCGGATCTAACGAGTGCCATGAAAGCTCCACAATCGCTGCTACAACACAGATCGCTGCTTATCTTCCACCATGTGGTGAAGGCATTGGCTTCCAAACGACTCATAGAAGACAAACGGACATTTCAGGAACTAACAAACTCAGTATACGCTTTCATCCTGAACCTGTGGCATGAGAACACAGAATGTTTTCTGAGACATATACAGGAGGGCGCCTCTACTGAGCTGATAACAGAACACCTCGAAAAGGCCTTGCTGTGTCTGAGAATATTGAGGAAGCTGACGGTGTTCGGCTTCAAGAAGCCACACGAGAGTCAAGACGCTGTGGCATTCCTTAATGTTGTGTTCGACAGAGCTAAGACGAGCCTGGAGTGCAGGAAGTTGTTGAAAGGTCGGGGCATATATCCATTGGAGCTATGTGAGAAATTTATCATTCACCTCACCAAAGTGGCGCTGGGGGTGCTGTCCTGTCACCCGTTCTCATACATGCCTCTGATCAGACCGTCGCTGGAATTCACTCTATACTATTGTTTCACTGAAGTCGGAATGTCTTTGACATACGAAAGGTTCACTATACAGTGTTTGAATATACTGAAAGGCATCCTGCAGTGCGTCGAGTACAAACTACCTAAAGGGAACGAACCGGTCAAAGAGCCAATCCGAGCTCAAGCCCACCAGCTCAAGTGGCAGGTGTTGGATCAGCGGACAGTGTGCCACATGTGCCGACACCTAGTCACACACTATTTCCTACTCACGGCTGACGACCTCGCGCTTTGGGACGCGGAGCCGGAGAGCTTCGCAACCGACGAGGCCGGCGAATCGTGGAAGTACAGCCTTAAGCCGTGCACCGAGGCGGTGTTCCTGGAACTGTTCCACGAGTACCGGTCGGTGCTGGCGCCCGAGCTGGTCAGAATGTTGGCGTCCCTGCAAGAAACACAAGTCTCCCCGGACGACCTGCCCGCCATATTGAAAAAGGACGCCATTTATAACGCTGTCGGACTGGCAGCTTTCGACTTATACGATGATGTCGACTTTGACGAGTGGTTCACGAACGTGCTGAGCAAGGAATTGAAGATAGAAGACAACAATTACAGAATAATACGGAGGAGAGTTTGTCAGTTAATAAGTCATTGGTGTGGAGTGCGAGCGTCTCAGTCGCTCCGGCCGGCGATGTACACGGCTCTGCTGGAACCGCTCACTCGCAAGGGCGAGGATCCTGCTGTGAAGCTTGCAGCCGCTGAAGCGCTGCGCAGCACTATAGACGACTTCAACTTTGACGTTGAACAGTTCGCGCCGTTTGCGCCGCACGCGCTCTCTGCGCTTTATGATCTATTGGTGGAGTGTACGGAATGCGAGACTAAGATGCACGTACTTCACGTTGTGTCGTACGTGACGGAGCGGTGCGGGTGGGTGGTGGCGCGGGGCGGGGGCGGAGGCGCCGGGGCTCTGTTCGCCGTCCTACCCGCGCTGTGCCTTCACGCGGCGCACCATCACATGCTGCGAGCCGCCGCCCTCGCTGCACTGGTACACCTCGTTAAGGCGTTGGGCGAGTGTGATCCGAGCGTCCGTCCCTGGGTGTTGAACGTCATCAACGAGTCAACGAAGCTATCTGAGCCGGCTCACGTGTACCTGCTGGAAGATGCCCTGGAGCTTTGGCTGGCGGTGCTGGAGACCTCACCAGCTGCCGACCAACCTCTGCTTCAGCTGGCTCACAATCTTTACCCCATACTCGAACAATCTACGGAGCACTTACGAATAGTTGTCTATATAATGCAAGCGTACGCTCTCCTGTGTCCTGAAGAATTCTTCACGGGGGCCGGCGCTAAATGTATGTCGCTGTTAGACGATATGCTGAGCGATATGAGGACTGAGGGGGTCATCACTGTACTGAAGATGGCTGAAATATGCATCAGTGTGACCTTCCCCGAGAGGAACGCCTTTCTGGGGGTCAAAGTTATCTGGCCGACTCTAATACGGGTGATACACTGTCTTTTAGAAGGCGATGATCTTCCTATGGTGTTATCGGTGCAGCTGTGTCTGTTATCGCGAGTGATTCTCATCTCGTCGGATTTGTTCTACAAAGTGGTCCAGGAGGCTGCCGCCGGACTGGTCGAGTACTCGTCGGACCCGGTGAAGGTGTTGACTAGACTGTTACACGTCTGGACGGAGAAGATGAATCTAGTTACGCAGGTGGAGAGGTGGAAGGTTGTTGGTCTAGGTCTAGCCGCTCTACTGACGACTCACAATGCTGCTGTTCTGGAACGTTTTCCAGCAGTATTGTTGAATCTCACAGAAGTTCTCAATGACGTCATGAAGATGGAAGAATCCGGGAATTACGTTGACGCTTTACTTTCACCTCCCGGCTCGCGGCCCGCGTCTCCTCTCGAGGGCCGTGGCGGCGCCGCTCGCTGGGAGGCGGCCGACGACTGCGCCGCGCCAGCGGAACAACAGCTCACAGCACACGAGATACGGAGGAGGAGGTTCGCGGCGGCCCATCCCGCGAGGGCGCTGGACCTGCGAGCTGTCACACACCACCAGGTGACTGGATATGTGACTGGGAGGGAGAAGGATGACTCGATGGGCCGCGCGTAA

Protein sequence:

>DPOGS200024-PA
MPYIRVYTGKEKVFELQIDIFKELKGHKVKPGSISSQFHFILPPDLPSTYEDSVAKIYYHIILKEKAQQIIKIPITVISYVNLNHLQEYKVPTIYDMTKVCRFSEEVVVFLRTYKSFAPQQFVPFEAIISNEKNVTITKIVVKLIQKLEYNVSSGYYNAEKTVCKSEHLDIKNVARQTCTFGMNVPKVIPSALNDYNHMVKSSYLLRVKISFRFHFPLYSDIPVVDKMDPNIYALVLDTLNRATSQDAEVLKPAEKKLQEWEVEPGFYSVLLNVLSNHSIDINVRWLAVMCFKNGVDRYWRRNAPNAITDEEKQKLRQGLLTSPILSEPIAQIATQQAVLISKIARFDCPTNWPTLLPDLTSAMKAPQSLLQHRSLLIFHHVVKALASKRLIEDKRTFQELTNSVYAFILNLWHENTECFLRHIQEGASTELITEHLEKALLCLRILRKLTVFGFKKPHESQDAVAFLNVVFDRAKTSLECRKLLKGRGIYPLELCEKFIIHLTKVALGVLSCHPFSYMPLIRPSLEFTLYYCFTEVGMSLTYERFTIQCLNILKGILQCVEYKLPKGNEPVKEPIRAQAHQLKWQVLDQRTVCHMCRHLVTHYFLLTADDLALWDAEPESFATDEAGESWKYSLKPCTEAVFLELFHEYRSVLAPELVRMLASLQETQVSPDDLPAILKKDAIYNAVGLAAFDLYDDVDFDEWFTNVLSKELKIEDNNYRIIRRRVCQLISHWCGVRASQSLRPAMYTALLEPLTRKGEDPAVKLAAAEALRSTIDDFNFDVEQFAPFAPHALSALYDLLVECTECETKMHVLHVVSYVTERCGWVVARGGGGGAGALFAVLPALCLHAAHHHMLRAAALAALVHLVKALGECDPSVRPWVLNVINESTKLSEPAHVYLLEDALELWLAVLETSPAADQPLLQLAHNLYPILEQSTEHLRIVVYIMQAYALLCPEEFFTGAGAKCMSLLDDMLSDMRTEGVITVLKMAEICISVTFPERNAFLGVKVIWPTLIRVIHCLLEGDDLPMVLSVQLCLLSRVILISSDLFYKVVQEAAAGLVEYSSDPVKVLTRLLHVWTEKMNLVTQVERWKVVGLGLAALLTTHNAAVLERFPAVLLNLTEVLNDVMKMEESGNYVDALLSPPGSRPASPLEGRGGAARWEAADDCAAPAEQQLTAHEIRRRRFAAAHPARALDLRAVTHHQVTGYVTGREKDDSMGRA-