Monarch geneset OGS2.0

DPOGS211901
TranscriptDPOGS211901-TA1707 bp
ProteinDPOGS211901-PA568 aa
Genomic positionDPSCF300011 - 165730-167599
RNAseq coverage382x (Rank: top 31%)
Annotation
HeliconiusHMEL0068445e-14753.02% 
BombyxBGIBMGA000995-TA3e-6570.24% 
DrosophilaNup50-PA7e-5733.66% 
EBI UniRef50UniRef50_Q7K0D81e-5433.66%CG2158 n=5 Tax=melanogaster subgroup RepID=Q7K0D8_DROME
NCBI RefSeqXP_001651179.18e-5643.83%nucleoporin [Aedes aegypti]
NCBI nr blastpgi|1948635843e-5432.08%GG23336 [Drosophila erecta]
NCBI nr blastxgi|665045807e-7034.54%PREDICTED: nuclear pore complex protein Nup50 [Apis mellifera]
Group
Gene OntologyGO:00056431.2e-20nuclear pore
GO:00055151.3e-14protein binding
GO:00469073.1e-09intracellular transport
KEGG pathway 
InterPro domain[2-160] IPR0150071.2e-20Nuclear pore complex, NUP2/50/61
[450-567] IPR0119931.3e-14Pleckstrin homology-type
[458-565] IPR0001563.1e-09Ran binding protein 1
Orthology groupMCL13234 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211901-TA
ATGTCTACGAAAAGGCAGGCTCCAACAGAATTAAATCATGAGAACTGGAATGACGAAGATCCAGTGGAAGAAGACCAGACTGGTGGTTTTGCTATAGCTCCTAAGGAGGTTCTTGAAAAGCGTGTGATAAGGACCGCTAAAAGACGCTCTCAAGCCATCAGCGGTGAGGCTAAGAAGAGTGTTTTTAGTGGCTTCAGTGGATTTAACAAAATTCAGCACTCATCATTTGATTTCTTATCAAATTTAACCAATGGCAGTAAGAATAATGGGAGTCTTACATCTACAAGTGAGACAGCTGTTTCATCGAGTTCCGTCAACAGCAACCCATCAAATACGATAAGCAGTGGAATATTCAGTTTATCAGGAACTTCAACTCCGACAAAGGCAGCTTTTGGCAAACCACCAGCAGATTCTCCATTTGGTGGTACATCAGAGACCACTGTTACTACTAGCTCTTTATTTTCGCAGTCTGTGTTTACGACATCCAAAGCAGATTCAACTGTTAGTGATAATCCATTTAAAATCCAGTCAACTGTGACCTCACCCATCGCAGGTGATTTTGGCAAAAAAACTAACCCTGCGTCAGATAAACCCTTATCTCAGGTCTCCTCGACTATATTTGGAACACCATCAACCAGTGCTAACAGCGGTAACACATTATTTTCATCTACATCTAACAACTTTGCAACATTCAAGAAGCAAACACCACAGATATTTAACAAGACAAGTGATGCTAAGAAACCTGAAAGTAAACCGGAAAGTAAAAGTAGTCTAATGACTTACCACAACAAGCTTAAGGGCCTTAATGAGTCGGTATCGGACTGTATCAGAAAACATGTAGAAGAGATGCCAATATGTATTTTAACTCCAATCTTCAAGGATTATGAAAAATATCTAAAAGATATTCAAGAAGAATATGAGAAAAATAAGGAACAGGAGAAGAAAGACAGTGCCAAGTCTGAACCTGAGCCAAAACAGAATACCAGCCCTGAGAAATCTGTGTCAGTTAGTAAGAGTACAACATCAACATTATTCTCAGGGAATAAATCATCCATATTTGCCGCTGTAACATCTTCACCGAAGACATCTAACACAACTTCAGTCTTCAATACTGGCACCAGCATCGCAAGCTGTACATTTTCGACTACAACTACTCCGAGTACTGGATTTTCCTTTGGTATTAAACCTAATACAAACACGTCTCCCTTAACGTCTGCTACAACAACCAATGGTAACACACCATTCTCTTTTGGGATGGGCAAACCTTTCAGCTTTAGCACAAATGTAAATACACCGAAACCTGAAGAACCAGCTAATGAAGCAAACGATAATGAAGATGTACCACCTAAAGTTGAATACACACCGATAGTAGAAGAGAACAGTGTTTATGATAAAAAATGTAAAATATTTGTCAAAAAAGACGGCAATTTCATTGATAAAGGTGTCGGTACTCTGTATATCAAGAAAGTAGAAGACAGCGGGAAGCATCAACTGCTGGTCCGAGCTAACACAGCTATCGGAAACGTCCTACTCAATTTAATTTTGTCTTCGGGCGTTCCGACCCAGAGAATGGGCAAGAACAATGTGATGATGATCTGCATCCCCACTCCGGACGCCAAACCTCCACCGACCTCTGTTCTGGTCCGAGTGAAGACATCAGAGGAAGCCGATGAATTGTTAGAAACTTTAAACAAATACAAAGTCTGA

Protein sequence:

>DPOGS211901-PA
MSTKRQAPTELNHENWNDEDPVEEDQTGGFAIAPKEVLEKRVIRTAKRRSQAISGEAKKSVFSGFSGFNKIQHSSFDFLSNLTNGSKNNGSLTSTSETAVSSSSVNSNPSNTISSGIFSLSGTSTPTKAAFGKPPADSPFGGTSETTVTTSSLFSQSVFTTSKADSTVSDNPFKIQSTVTSPIAGDFGKKTNPASDKPLSQVSSTIFGTPSTSANSGNTLFSSTSNNFATFKKQTPQIFNKTSDAKKPESKPESKSSLMTYHNKLKGLNESVSDCIRKHVEEMPICILTPIFKDYEKYLKDIQEEYEKNKEQEKKDSAKSEPEPKQNTSPEKSVSVSKSTTSTLFSGNKSSIFAAVTSSPKTSNTTSVFNTGTSIASCTFSTTTTPSTGFSFGIKPNTNTSPLTSATTTNGNTPFSFGMGKPFSFSTNVNTPKPEEPANEANDNEDVPPKVEYTPIVEENSVYDKKCKIFVKKDGNFIDKGVGTLYIKKVEDSGKHQLLVRANTAIGNVLLNLILSSGVPTQRMGKNNVMMICIPTPDAKPPPTSVLVRVKTSEEADELLETLNKYKV-