Monarch geneset OGS2.0

DPOGS213613
TranscriptDPOGS213613-TA3588 bp
ProteinDPOGS213613-PA1195 aa
Genomic positionDPSCF300033 + 852500-864664
RNAseq coverage275x (Rank: top 39%)
Annotation
HeliconiusHMEL0136880.080.57% 
BombyxBGIBMGA011675-TA0.074.38% 
DrosophilaNup160-PA3e-5627.46% 
EBI UniRef50UniRef50_E1ZVT23e-7329.93%Nuclear pore complex protein Nup160-like protein n=5 Tax=Formicidae RepID=E1ZVT2_CAMFO
NCBI RefSeqXP_001845754.17e-7024.21%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3072129441e-8131.60%Nuclear pore complex protein Nup160-like protein [Harpegnathos saltator]
NCBI nr blastxgi|3072129447e-7631.51%Nuclear pore complex protein Nup160-like protein [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[31-535] IPR0217172.6e-36Nucleoporin Nup120/160
Orthology groupMCL11943 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213613-TA
ATGGAGTTTGTCGATGCAATACCGGTAACGTTCAAAGAAATAACTCCAAATCATAATATTCCCGAAAAATGGAAAGAAGTTGTATTGAATACAGGAGGCACTCACAGTACATTACAAGATATAAAATTACCTCACAAAGCTGGCGGCTACTGGTACAAAGATTCGAAGAGAGCTAACACTAGGAACAGATTCATTTACTGGCAAACTACATCAGATTCTATTGAATTGTCAGAAGCTTCTTTAGATGTTAACTTGTCTGGGGCTAATTTGAGATGTAAAGTTGCAGCTGGGACTCCACCGTTAAATAATTTAGAGTTTTATGAGAAGACGTCCTCACAACAGTTAGTTATTTTAGCTGCTACTGTCTCTTCTGTACATAGATTAGTCTTTCCTCATCCTGATCTTCTGGATAGAAAGTCTACATTTGGGTCTATGTGCAGCATCTGCCCTTCCATATTTCATGACACTAGCTCACTCAACAATCCAAACAATTACTACATACTTAATCAATATTCTAACACAAATACAGGTGTGGCACATTCCTGTGCTTCCCTGCTGAGGGACAGCGGAGAGGCAGTTTTTGCTTTGGCATTTGGATATGGCGGTGAAGGAGGCCTTTTATTAGTAAAGTTGCCGCTGGCCGGCTCAGCCGTTACATTATCATTAAAACGAGAATCAACCGTGCCAAGATTCTTATCTGGCATCACGGGTGCTTTAAGGGGAAAAAGTGATGGAGTTGACACATATGGTGTGGTTCTGACAAACGGCTTGGTTGTTGCTATATGTGGAGATAACTGTCTCAGAGTGTGGTCGCTAGATGATGGTGGTGTACCAACAGCTGTTTCAACCTCTTTATCCCAGACATTGGTTAAACCAAAGCCTCCTCCTCACGGTCACATGCTGCAATATACAACAGGCTTAGATGGAAGTATAATATTGGTAGCTTATTTATCATTCCCAAATGAATGTGAATTTGTAGTTATGAAGATGCATGATGGGGGTGTGGGTGCTGCTAGGTTCACACAAATGTGTCACATCTTTGGACCACAGTTGGATCTTCTTGATTATACAATCGGTTACGGAGATGGTAACCACGTCATTTGGGCGTTATGGACTCAGCCCGATGGTGACGCTGTTGTTACAACTACTGGCATCGGTCCGGAGGCGCAGTGGCGCGCTGTCGCTGGTCGCGAGCTTCCGCCCTCTCTTCCTCAGTTGTCTTCACTCAACATGTACCGGGATCGGCTACTCGCGCCGGGACTCTTCCCTCCGGCCGTCATTCGTAAGGCATTGGTTATATACCGTCGTCAGTGGGGTGGTGAGGGGGGAGAGGGGGATGTTGATTTGGGGGAGGCGTGTGTGTCGGCGGTACAGGCAAGACTACGAACACTCACCGCACGACACCAGCCAGCCGACCATTCACACCTCATGCATAAATGCTGGACAGATTTATACAGCTGGTGCATGCAATACATGGAGGGGCTGCAGAAACCGCTCGGCCTCATGGTGTCTAAACATTATAAGGAGTCAGAGTCGGGCTGGTGGTGTGGCGTAGTGAGGCGGGCGGGCGTCTCCCTCGTTAGACAGTTGGAGCCCTACGAGCGAGTCATGTTGTCCCCGGAAGATACATTGCCTGATACTGTCTTCAGAGGTAGTGGGGAGGTGGGTCCGGTGTCGTCGGAGGCGATGCGCGTGGTGGTTGCGGGGGCGAGGTGGGAGCGTGGCGCCACTCCGGCGGGGGCCGCAGAGCTGGAGAGACGTCTCTTCGCGTGTGCTGCTCCACAGCACCGCCTGCTGCCGCGTCTGCTACATCTGCTGCTGCAACCCCCGGAGGACGCCGCCACGCCCACGCTGTCCCCGGAACAAATCGATGAAATCTCAGCCATCTTGGAACCGATCAACGATTTGCAGAATGCGGTACTCAATTTAAACGAAGCCCTGAGACTGGACGTTCCGGAGATAGATTCTAGTGACAATGATGATACCGCTGAGTACGATAATCTCTTTGCGAGCGATCTCGGAGTAGCTATAGTCACTGAAGCTATACGACAAATGGCTGAGATGAGATGTCGCGTTGTCCGCGGCGCCCTCGCAGCTCTCGGCGCGGCTCGCGGCGCCGGCGGGGTCCCGGGCGCTGGACACTGCGCCGTACACTGGCAGGCCTACCGAGCGCTGTTGTGGCTCAGGGCTGCCACTCACCAGTCACATGAATACTGGGCTGTGAGTGGTGGCTTCGAATTTGGTTGGTGGCTGGCGTCCATCAACCAGCCTCGCCTTGTACAGAGCTACGTTGCCCTTCTGGAGCCCTGGTGCGAGTGGAACGCCTGCTCACGCCAGTTCATTTTGGGCATGTCTCTATTGGAGCTGGGAGAGTCGGAGGCTGCGTACACGTCGTTCTGCCGCGCTGCCAAGGGTGTCAGCACGGAACCCTTCCTTAGATCGCTAGTGGCGCCCCACGACACCGCACTCACGCAACACCAGGCGCTAGTATTGTACTACATGAAGGTCATCAAGTTATTCGAGATCCATGATGCTGGAGCCTGCGTCGTGAGATTAGCGGAGACGGCTATCAGTATAGCTGATAAGGATGACCCTAATCTGGCAATGTTCCAGTGGGTTGTATTCAAGTGGCACCTGTCCGGTGGCCGAGTGTCCCGGGCCCTGAGTGCAGCCGCCGCAAACCCAGCAGCGAGCGCCCGAGCAGCCGCCGCCGCTGCCCTACTCACTACCCTCGCGGAGCGTAAACAGCTATCGGCGCTGGTGTCGTGTGGGTCGCTGGCTTTGGAAGCGGAGCGAGCGGCCGCGGCTCGCGCCAAGCTGCACGACCCCTACCCACACAACCCCTACTACGATTTCCTGTACGCGCTGCATCTTTCCAGGCATCACTATCGAAAAGCTGCGGCGGTAGTGTACGAGCGTGCGGCGCGGTGTGCGGGCGAGAGCGGCCCGGCGCGCCGGCGCTGGCTGGCGGCGGCGCTCACCTGTCTGAGGCTCGCGCAACCCAGACACGCCTTCCTCGCCAGACCTGACAGGACGCGGAACTCTAACGACGCTCTGCAGATTATTGGTCCCGAAGAACTTGCAGCCGAATTACGCGAGGAAGTTCCAGAATCCTTAGATCCGGTTCAGCAGGCTTTACTCAAAACGGATAACATAGATTTTGACTACTTATATCCAAATTTGAAAGAAGCTGATCCGGAAACTCTTCTAGCGGTGATGAAGAGGGCGATCAGTACCGGCCAGTTTATGCCACATTGGTTTCTTCAAAGGTTTTTGGAGCTGGAGCCTAATTCGTGTATTCGCGCCATGCTGAGCGGCGGGCGGGCTGTGGAAGCTGCTGAACTGTGTTGTGCTGCACTACGTCGCGACGCCCTGGCCCTTGTGCCGACCACCAACGCACCACCACGCGCCTCGCCACTGTCACTAGCCGACTTACTGCTACATGAACTGACTGAACATGATCACAATCCAAGAGTCAGAGAGGTTTACAATGACTTGCAGCTGATAGTTGAAGAGTATACCAAAATGATAGATCGCACATCGGAAGACCTAAAACTATCACAGTTAAAATACGGAATGACCAATTAG

Protein sequence:

>DPOGS213613-PA
MEFVDAIPVTFKEITPNHNIPEKWKEVVLNTGGTHSTLQDIKLPHKAGGYWYKDSKRANTRNRFIYWQTTSDSIELSEASLDVNLSGANLRCKVAAGTPPLNNLEFYEKTSSQQLVILAATVSSVHRLVFPHPDLLDRKSTFGSMCSICPSIFHDTSSLNNPNNYYILNQYSNTNTGVAHSCASLLRDSGEAVFALAFGYGGEGGLLLVKLPLAGSAVTLSLKRESTVPRFLSGITGALRGKSDGVDTYGVVLTNGLVVAICGDNCLRVWSLDDGGVPTAVSTSLSQTLVKPKPPPHGHMLQYTTGLDGSIILVAYLSFPNECEFVVMKMHDGGVGAARFTQMCHIFGPQLDLLDYTIGYGDGNHVIWALWTQPDGDAVVTTTGIGPEAQWRAVAGRELPPSLPQLSSLNMYRDRLLAPGLFPPAVIRKALVIYRRQWGGEGGEGDVDLGEACVSAVQARLRTLTARHQPADHSHLMHKCWTDLYSWCMQYMEGLQKPLGLMVSKHYKESESGWWCGVVRRAGVSLVRQLEPYERVMLSPEDTLPDTVFRGSGEVGPVSSEAMRVVVAGARWERGATPAGAAELERRLFACAAPQHRLLPRLLHLLLQPPEDAATPTLSPEQIDEISAILEPINDLQNAVLNLNEALRLDVPEIDSSDNDDTAEYDNLFASDLGVAIVTEAIRQMAEMRCRVVRGALAALGAARGAGGVPGAGHCAVHWQAYRALLWLRAATHQSHEYWAVSGGFEFGWWLASINQPRLVQSYVALLEPWCEWNACSRQFILGMSLLELGESEAAYTSFCRAAKGVSTEPFLRSLVAPHDTALTQHQALVLYYMKVIKLFEIHDAGACVVRLAETAISIADKDDPNLAMFQWVVFKWHLSGGRVSRALSAAAANPAASARAAAAAALLTTLAERKQLSALVSCGSLALEAERAAAARAKLHDPYPHNPYYDFLYALHLSRHHYRKAAAVVYERAARCAGESGPARRRWLAAALTCLRLAQPRHAFLARPDRTRNSNDALQIIGPEELAAELREEVPESLDPVQQALLKTDNIDFDYLYPNLKEADPETLLAVMKRAISTGQFMPHWFLQRFLELEPNSCIRAMLSGGRAVEAAELCCAALRRDALALVPTTNAPPRASPLSLADLLLHELTEHDHNPRVREVYNDLQLIVEEYTKMIDRTSEDLKLSQLKYGMTN-