Monarch geneset OGS2.0

DPOGS214760
TranscriptDPOGS214760-TA3363 bp
ProteinDPOGS214760-PA1120 aa
Genomic positionDPSCF300022 + 1313620-1320642
RNAseq coverage566x (Rank: top 22%)
Annotation
HeliconiusHMEL0117712e-8651.22% 
BombyxBGIBMGA004754-TA3e-9737.64% 
DrosophilaNup133-PA6e-9826.88% 
EBI UniRef50UniRef50_D6WR928e-12528.11%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WR92_TRICA
NCBI RefSeqXP_002431150.14e-12329.13%Nuclear pore complex protein Nup133, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700095453e-12428.11%hypothetical protein TcasGA2_TC008819 [Tribolium castaneum]
NCBI nr blastxgi|2420214352e-11529.65%Nuclear pore complex protein Nup133, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151.9e-63protein binding
KEGG pathway 
InterPro domain[44-430] IPR0159431.9e-63WD40/YVTN repeat-like-containing domain
[774-1070] IPR0071872.1e-19Nucleoporin, Nup133/Nup155-like, C-terminal
Orthology groupMCL11882 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214760-TA
ATGGAGTTTAACAGCACAGGAGGAATGAGGAGTCCTTTTTCTCCTCGAGTACGACAATCTATCTCGGGGCGAAGACCTATTGGACTGGGTTCGGCAAAGAAAAATAGCAAGTTCATGCAACAGTCAGAGCAACAAACCGGCGAAATTGTATATAAAACTCCCTTCACGACTCTGGAAACATTCGGAACACCACTTCCAGTAATGGTTACAGAAACTCTTACCTTTCCATCTAGTGAGGTAAGTGTCCGTCTGTCCTCCTGTGGTTGGTGTTGGGCGGTCTGTGGGCGTAAAGTGCTAGCGTGGCCCTGGGACACCTCACTCCCTGCTGCCACCGCTCGTGATCTCACACTGCCACAGACAGACCTGGCACACAAAGCTGATTTAGTTGTTCTGTTCTACGAGAATGATGCACAGTTGCCATCCTGTATTGGTGTGTCTCCGGAGGGGGTGGTCCGCTACTGGTCCAGTGTAGGGGCGGAGGGCGCGTCGTGTGACGTGTCGTGTGAGCTCGCTGGCCAAGAGTGTGACAGGCTCATACAGGCCAGGGATGGACTGCTGCTGGCCACCACCACCTGCACCCTGGTCAGAATCACTACTACTAAGGAGGCTCGTCCGTCCGTGGTGTGTCACACTCTCCGTCCCCCGAGCGGCTGGCTGGGAGGTCTGGGCCGACGAGTGTCAGTACTGTTCTTCGGTTCCATGCCGGCTAACCATGACACGAAACTGGTGAAGGTGGTGTTGCTGAGCAGTCCTCGTGCGGACGAGCAGGCAGCGGACAAGGAGTGCGTGGCGCTGGTGGCCGGCGGACCGCTCGTGCAGTTGTGGGAGGATGGCGACGTCAGGGAGGTCTCGCTTCGGAGACCGCTCTGTGACGCGCTGGCCAGGACGCACCTCGCGCCCGCCGGTGAGCTGAGCGGGCTGGAGGTAGCGGCGCTGGACGCGGAGCCTCACCCCGGTGGCGGCCTGCTGCTGCTGCTGCTGCTGACCGTGGCCGCGCCTCGCGCTCCCGATGCTAGATATGCCCTCGCTCACGTATCCTTGGAGTCCGAGGAGCGTGTCCGCGTGTTGTCGGCGTGGTGTGTGAGGGGCGCACGCTCCGAGAGCCTCCCCCGCTGCCTGCCGCTGCAACCCCCGCTGGTCTACAACAGCGACGCAATCATCGGAGTCGCTCCGTCGGGCAAGGATCAGGCGGACGTGTTGGAAGTGTCTGCGGAGGGCGACTCCATCCTGGGAGCGTCCCAGGTGGGGGGCCGAGCCCTAGTGTTCACGAGACGACACGGGGTGCTGCTGCTACGGACCGCCGACCCCGCGGCTCAACATCATGCACCGTCGCTGTGCGACAGTCCTCTCGGCTCCCCGTGTCCCTCGGACGTGTTCGACGGGAACTTGACGCTGTACGAGATCGACCCGAACGAGGTGGTGGCCATAAGCGGTGACGCGGTGGGCAAGTTGAAGAGTGCGTTCCTGTACCACGTCCGCGGCCAGCAGGCCTCCGCCGCTGCCCTGTTGTCGGAGCTGGCCGGGAGACTCGACCCCTCCGCCACGGACCGGCCTCTGGATCGGACCGTGGTCACCGTTACCCGGGAAATGCTGGACGACGCGCCGGCCGGGGACCCCAGGTGGAAGCTCCCCAGTGGCGCGGCGACCCGCGTGTCCCTGGGCAGCTCGTGCTCGCTACAGGCGGCCGCCCAGCTCCACGACAAACAGAAGGTCTATAACATGTTCCTGGATTTCCTGAGGAGCCGCGGACTGTGGAGGCGCCTGGGGACCGTCACCGGGGACGCTCCCCCCGCAGAGAACGGCGAGGGCGTGTCCAGCACGCAGCACGAGGTGTGCGCGCTAGGAGAGCGGCTGGCGGCGGCCCGCGCGTTGCAGAGGCTGCACCAGGCGGGTGCTCCCCTTGTGGACGCGGCGCTGCACCAAGTGGCGGCTGGACTGGAGCGCGCGCCCGGCCATGAGGACGAGGCCGTGTTGGAGGCACTCCGAGGCGGCGCACTATCCGCGGCCGACGTGTGCTGGCGGCGCGTGTCGCGAGTGCTGCGCGTGCTGACTGCGCTGTGTTCCCTGCCGCCGCCGCCGCACGACGCTCGCGCCGCCGCCTCGCACGCGCACCACGCGCTGGTCGCCGTCAACTCGGTGATGAGCGCGATGCAGGCGTACCGCTCTCAATGTGACGCCGCCCCGCCCCGCGCCGCTCCGTCCCTGGCGCCGCACGCCCTGCTCCCATCCCTGTGCTCGCTGCACACCCGCGCCGTCACTGAGTGTGCTCGCAAGTGTCCCGATGCGTCGCTCCGTTCTCAACTGCTGGAAGAGGCGTCATCGCTGGCTCGCTCCATCCTCCTGGAGGCGGAACCCCTGGCCGAGGGTCGCACGGCACATCTATACGAGAAGATGCGCTCCGACACCATACAGCCCTACCTCGCCGAGGGCCAGGCGGAGCGAGCGGCGGTGCTGGCGGAGAAGTTCAAGGACTTCGAGCTGTTGATACAGATGTGTGTGGACAAAAACGACCTGGAGAGGCTGGACGGGTACATGGACAAGTACGAGGACGAGGGATTCCCAGAGAAGACGTTCGCCTGGCTGGCGTCCCGCGGGGGTCGCATGTGTGCGTTGCTGGTGAGGTCGGTGGGGGCCCGCGTCCCTCGGCGCCTGGAGTCCTGGCTGGCCGCCGCGCCCGACCGCCTCACCCTCAGGACCGTCCACGCTCTGGCACGGGGAGAGCTCGACCTCGCCACGGAGCTGTTCGCCCAGCTCGCGGATAACGAAAATGTTTACGTCAACAGAATGGCGGATTTAACACGGTCGTTGTTTCAGACGGCGGCCTCGTTGTCCAAGCTGTGCTCGCTGGCCGGCGGGTCGCAGGAGGCCGCCAGCCGCGTCTGTCGCGCCTTCAGCGTCGTCCGCCAACACCGCGCCCTGCCCGCCGCCCTCACCCGGAGACACGCGCTCGACCAACACGAACCCAAACTGTTCACTCCCGAGGAGCTCATACAGATGTACATTGAGTCCGAGAGTCGCTCGCTGACGGAATACGACTACAAGAAGGCGCTCGACCTGACGGAGCTGGTCACAGACCTGGAGCGGAGAGACGACCTGCGGCTGAAGATCTGGTGCGCGTGTATCCGTCAAGACGACTGGTCGCGCAGCCGCGTGGACGCGCCGGAGCTGGAGATGAAGGACAAGATGTTCTTTAGACTCCTCGACCTCGTGCATCTCATGGGTGGAGAGCTGGACGCGGTGCTGCCGCCGCTGGAGGCCGTGGTGTCGGCCCCCGAGCTGGAGTCGCTGGCGGCCGACCCGCGCGCCCTCTACCTCCTCAAGTACGGCTACCAGTGTGTGGCGACACAACACGACGACCAATAG

Protein sequence:

>DPOGS214760-PA
MEFNSTGGMRSPFSPRVRQSISGRRPIGLGSAKKNSKFMQQSEQQTGEIVYKTPFTTLETFGTPLPVMVTETLTFPSSEVSVRLSSCGWCWAVCGRKVLAWPWDTSLPAATARDLTLPQTDLAHKADLVVLFYENDAQLPSCIGVSPEGVVRYWSSVGAEGASCDVSCELAGQECDRLIQARDGLLLATTTCTLVRITTTKEARPSVVCHTLRPPSGWLGGLGRRVSVLFFGSMPANHDTKLVKVVLLSSPRADEQAADKECVALVAGGPLVQLWEDGDVREVSLRRPLCDALARTHLAPAGELSGLEVAALDAEPHPGGGLLLLLLLTVAAPRAPDARYALAHVSLESEERVRVLSAWCVRGARSESLPRCLPLQPPLVYNSDAIIGVAPSGKDQADVLEVSAEGDSILGASQVGGRALVFTRRHGVLLLRTADPAAQHHAPSLCDSPLGSPCPSDVFDGNLTLYEIDPNEVVAISGDAVGKLKSAFLYHVRGQQASAAALLSELAGRLDPSATDRPLDRTVVTVTREMLDDAPAGDPRWKLPSGAATRVSLGSSCSLQAAAQLHDKQKVYNMFLDFLRSRGLWRRLGTVTGDAPPAENGEGVSSTQHEVCALGERLAAARALQRLHQAGAPLVDAALHQVAAGLERAPGHEDEAVLEALRGGALSAADVCWRRVSRVLRVLTALCSLPPPPHDARAAASHAHHALVAVNSVMSAMQAYRSQCDAAPPRAAPSLAPHALLPSLCSLHTRAVTECARKCPDASLRSQLLEEASSLARSILLEAEPLAEGRTAHLYEKMRSDTIQPYLAEGQAERAAVLAEKFKDFELLIQMCVDKNDLERLDGYMDKYEDEGFPEKTFAWLASRGGRMCALLVRSVGARVPRRLESWLAAAPDRLTLRTVHALARGELDLATELFAQLADNENVYVNRMADLTRSLFQTAASLSKLCSLAGGSQEAASRVCRAFSVVRQHRALPAALTRRHALDQHEPKLFTPEELIQMYIESESRSLTEYDYKKALDLTELVTDLERRDDLRLKIWCACIRQDDWSRSRVDAPELEMKDKMFFRLLDLVHLMGGELDAVLPPLEAVVSAPELESLAADPRALYLLKYGYQCVATQHDDQ-