Monarch geneset OGS2.0

DPOGS215465
TranscriptDPOGS215465-TA4917 bp
ProteinDPOGS215465-PA1638 aa
Genomic positionDPSCF300098 - 407498-423802
RNAseq coverage381x (Rank: top 31%)
Annotation
HeliconiusHMEL0034278e-13149.16% 
BombyxBGIBMGA007325-TA0.057.49% 
Drosophilagp210-PA1e-5630.22% 
EBI UniRef50UniRef50_E2BC593e-7424.53%Nuclear pore membrane glycoprotein 210 n=8 Tax=Formicidae RepID=E2BC59_HARSA
NCBI RefSeqXP_001601346.11e-8026.32%PREDICTED: similar to ENSANGP00000004199 [Nasonia vitripennis]
NCBI nr blastpgi|3454885268e-8026.27%PREDICTED: nuclear pore membrane glycoprotein 210-like [Nasonia vitripennis]
NCBI nr blastxgi|3454885265e-8225.07%PREDICTED: nuclear pore membrane glycoprotein 210-like [Nasonia vitripennis]
Group
KEGG pathwaycfa:4846328e-26 
 K12495 (IQSEC)maps-> Endocytosis
Orthology groupMCL11310 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215465-TA
ATGAAGAGCCTACGGACTAGTATTGTCTCCGTGTTGACATTACTTTCATTTATTACTAATTGTGAATCTGCCAAAATAAATACACCGAGAGTTTTGCTTCCATGGTTTGAAAATTTATATGTAAACTTTACTTTTGAAATCATAGAAGGTGGCTGCTATTCCTGGAGTTTGTCTCGGGATGACATCATTGATCTTGTGCCATTGTACGAAGATACATGGGGTCACTGTTCACGAGCTGCACGTGTTTCAGTTTCCAAAACATGTATACCGCCGGGTTCAGTAATTATTCTAGCAGAAGAAGTTAATACAGGAGAGATCCTAAGAGGTGATGTGGATGTTGATATAATAAGATCATTAAAAATTATGAGTACAACTCGTAATTTATATTTGGAAGAAGCCCCCGAAGCTTTTGAAGTGGTGGCATACGATGACAAAGGAAACAAGTTTTCCTCTCTGGAAGGAATAAGTTTCACTTGGAATACTGAAAATGTGGATAATAGTGGAAATCACCCATTGATTACTCTCGTCCAATGGAAAGACACTGATTATGAAGCCCCTCAAGGTATAGCTGAATTAGAAGCTCAAGGCCTACAGTCATATTCTGTGCTTCTTTACGGTCAGGCTATGGGGGAATCCCATGTTACAGTGTGTCTTGATAAAATTTGTACCGACTTTTATCTTCATGTGGTTGCCAGTGTTGTTCTGACTCCGGCAGTGGCTTATGTTGCCCCCGGTGATACTTTAAGATATAAGGTTGTAAGAGCGCGTGCTGGTCGTCTGACTGTACAAGATGTGGCCGCCACAATATACAGAATGGAATTGCCACAATCTGATGTTGCCACACTAGAAGACGGCGTAAGCCTTGTCAGAGCTGCAGAAGTAGGGACATCTCATGTTTACCTTAAATCAGAGGCAACAGAAGTGGCAATGGCAACACTCACGGTTGTAGAACCATACTCTATACGCGTCACTTTAAGACCATCTAATATGATAATACGCGGTGAACAATTTACTGTACACTGTGTTGTATTCGATGAGGATGGGCACCCATTAACAGCTGGGCCGGAAATATCAATAAGGCTTACAGTGAACGGTGAAGCAGATGTCCTTTTGATGAAATCAACAGAAAACGGAACAATAACTGATGCTGTGGCCTACAACACTGGAGAGTTTACAGTTACAGCTAAATTAGTTTCGGTAGCTGGTAAAAGTATGTTTAAAAATGTTGAAGGTCAAATAACTGCAAAGGCTGTAGACCCGTTAGCGATGGTGCCACCAGAAATGTTTATAGCATGGACTGAATCAAACTTGGAGTTTCCATTAAAACATAGCGGCGGTGGTGATGAACATGTGACTTGGTCAGAGAGACAAGATAGCAATCTCGTGCTGACGCCAGCTGGTTTGCTAACTGTACGTGGCGTTGGACACATGGATGTCAGATTACATCTCACAAAGTATCCACATATCCAGGCGTCGGGGAGGGTGTGGTCAGCTGTTCCCGAACTCCTGCAAGTGTCGACCTCAGGAACAGCGCGCGTGGCTCGTCCTCATCATCTACACATCAAGCTCACTGGAACTCATCCGGCTACCGGGGAATTATACAACTTTAATACCTGCAACTGCGCGTCATTTGCTGTGTCCTTAGTCGAGGGTCCGGAGCCGCAGAATGTGACGGCTGCTCCTTGGGTAAAACCAAAAGATGGCGCGTGTTGTGTAGTGGAGTGCACGTTTTTGTTCCGCGGCGTGTCGACGTTGCTTGTGTCCCGTGGTAGGGCTGCCGACACGGCCCGTGTGGCGGTCAGGGCTGCCCCAAGCCTATTGTGGCCGCACGCCGCCGCTCTGTTGCCTGGCGCTACGATACCTGTGATCGGCGAGGGTGAATCACTAATTCCTCAGTCGAGTCAGCCGAGGGTCGCTGAACTCATCGGTCGTGATGGCGCTCCGCCCCACAATTATCCTGATGCCCAACTGTTCACCCTTAAATGCCAAAGGAAAGGTGAAGCGTTGGTGGAATTATCATCGTTGCAAGAGGAACGCGAGTCAGTGTCTCTTGAGACGTGGTGCTCGTCGCACGTGACGCGAGTTCGACTCGATCCTCCGGACACGCAAGGAAATTGTTCAGGACCGAGGCTGTGGTTGCGTCCCGGGCAAGAAGTGCCCATAAAGGTGACTCTGTTTGACACAATAGGAAGACAGCTATTGGACGAAGATGGACCGATTATGCAATGGGATGTACAACCTTATCATATAGGGATAGGTGTCAAAACCACAGACCGATTGTTTGTAGAGACAAACAGTAAATATGCTCCGGTACCAGTACCTGATAAATATTATCAACTAGTAGTGGCAACGGAACAGGCTATAGGATGGAGCGGTTCTATAAAAGCCGCAATTCCTGATACTTCAGCGACGATACAAGCCAAAGTTGTTTCTCCCTTGAAATGCGACCCTATGAAGGTTCACATCGCATGGGAAGGGGAAACCGTGCCTAATATATCAGCCGTCACCGGCGGCAGCGGAAAATATATAGTCGAAACCCCGAAAGGAGTTACAGCATCCGTTGATGAAGGCAGATTATCAGCAGTGTTGCCAACGCCGGGTACTTATGATCTGATTGTGGCAGATTCATGTGTTAGCGGCGAGAAGAACATTGTCGAGGTCATTATAGAAGAGGTTCTGAGTGTGGAGGTATCAACCGCGAGGGCTGTGTGTGTCGACAATTGTATTCCCATCAGAGCGCTGGTGAAAGGCGTCTCCCATAGATATCTAGCCACTAGCCGGGAACCAGATTGGAAGACCGAGGGCGACATTGTAGTTAGGAAGGGACAACTGTGTGGCCTCAGGGAAGGGGTCGGAAGAGTTAGAGCTTCTTTGGGGGGTGTATGGAGTCAATCTGTTGAGGTGTGGGTGTTTCCGTCGCTGGCGATTGTTCCTGAGAGGTCAAGGGTCGCGGTGGGAGGACGCGTTCACCTCAACCACGCGGGAGGTCCCCCGCGACATCTCGCCTCGTTATTGTATACCGGCGGTGGAGAGCACGCGCAGGTTTCTTCGTCTGGTGTGATACAAGGCTTATACCCGGGAAGTACTCGAGTCAAGCTCGTGGCGGTTGATGCTGCTAACGTTGAACTGGCGAGTGCCGAGGCCGAAATTGAGGTTGTGCCTATAACCACGCTCCGAGTGCGGGCGGCGACACAAACTTTGTTAGTGGGGTCGCCCGGACCCGTGTGGATTGAAGCTGCGGGGTTGACAGCGACCGCTCTATCCTCGCTCCAACCGTTGCCTCGAGTCACTTGGGCATTACGGGACCCCACCATGGCCAGGATATACACGTCGCATATTGACGATCGCTTGGAGAGGTCGGTTATTGAAGGACTATCAATACGAGTGGTGCCCTTAAAGCCCGGTGTTATCACACTCGATGTCAGAGTAAGGAACATGGGACAAGTCGCTGAAACTCGTTCCTGGGACAGTACAATCGAGATTCTTGGTGTATCAGATATCCGCACCTCCATAGAGGGCTTGAGAGATATAAACTCTGGTGAGATGTTATCCCTGGCCGTTGGTTCCACCGTTCGTTTGAAGTCACTACCTAAAGGACGGTGGTCTTCCTATCAGGATGGTAATACATGTCGAGGTCGCTATCCCCTACTACTGTACAGTGGAGCCGGCGGAGTCTTCGGAAACGTGGGAGTCCTTGAGGGTTGTGACGAGGAGCGTCCTTGTGCTCGCGGTAGCGAGATTACTCTCAGTGGTCTGGACTCGAATGGAGCTTTTATGAGCTTCGAAAGTTCCTTAGCTGGTGTTACCGTCTCAGACGAGGTCTTCATACCTGGATCCGACGCATACGCCAATAGGATAGTGGCGACGGGAGGATCCGCTTTGTGTATCGAAGGTTCGGGTTGGACGGTTCCAGCGGGGATCCAGGCGGTCTCCGGCGCGGGGCTGACACTGGCGGTGCTGATGTCGGATGCACCTGCCATGCACGTGTTGCGACTCGATCGACCGCCATCTACCGTCAACATTCTGCAACTTCCGCTCTCTAAGATGGAATTTCTTCCCGGAGAATGGCCGGCGTCCCTCGTACCTCTCTCCATCCAAGCGGAGGGACTCACGTCGGGCCCTCTCCTGTGTACCGAAGAACAAAAGTATGCTCTAGCCGGAGTAGACGTTGATGTGCCATTTAGCTGTCGAACGGCCGCCCCGTTTGCAGCGCAAGCTGTTCTGGATATTCCCAACGGACGACACGGATGTGCCATTCTCCCAGGAAATGAAATAAACGAGGCAGTCGAGGTGGAGCTGTGTGCCGAGTGGGGCGTTTTGAGCACTTGCACTAAAGTACAGTTGTTGCCGCCGATACAATTGTCACAAACACGAGTATCACTGCTGAATCCCCCTTCTATGTTTATTATTAATGGACACCCGAACGCTCTAAAGGCGGTCAAAATTACACCGTCGCCCGGTCTGAAAGTTGAAACGAATTCCAGGGAAGGTCAAATAAGTGTAACGGTTAAATCCGAAAGTACGACCTGTGGCGTTGGATGGGTCAACGTTATATCGAAATTAACGTCTCAAGAAATCAGGGTCGAGGTCGAGAGAGAATGCGAAATAGCTTGCGGGACGTTGCTGGGCGTTTTATTTTCTATAATGAAACCCTATCTATCAACTCTAGTGACGGTCGCCGTTATAGCCGTTGGATATTTGTATGTTCAAAATCATCTACAGCAAAAAGGTCAGATACAGTTGCCAAAGCCACCGCAGACTACCCTTCAGACTCCGCTGCCAGAGCCTCGGAGTCGGACGTGGTCGCGGAGTCCCTACGCCTCCAACCCATCAGCACCAGTGTATGGTGACACCAGCATGTTACCAGATGCGAGCTTCTCGCCGACATCCACTAGAATACATTCAAGACTACTTTAG

Protein sequence:

>DPOGS215465-PA
MKSLRTSIVSVLTLLSFITNCESAKINTPRVLLPWFENLYVNFTFEIIEGGCYSWSLSRDDIIDLVPLYEDTWGHCSRAARVSVSKTCIPPGSVIILAEEVNTGEILRGDVDVDIIRSLKIMSTTRNLYLEEAPEAFEVVAYDDKGNKFSSLEGISFTWNTENVDNSGNHPLITLVQWKDTDYEAPQGIAELEAQGLQSYSVLLYGQAMGESHVTVCLDKICTDFYLHVVASVVLTPAVAYVAPGDTLRYKVVRARAGRLTVQDVAATIYRMELPQSDVATLEDGVSLVRAAEVGTSHVYLKSEATEVAMATLTVVEPYSIRVTLRPSNMIIRGEQFTVHCVVFDEDGHPLTAGPEISIRLTVNGEADVLLMKSTENGTITDAVAYNTGEFTVTAKLVSVAGKSMFKNVEGQITAKAVDPLAMVPPEMFIAWTESNLEFPLKHSGGGDEHVTWSERQDSNLVLTPAGLLTVRGVGHMDVRLHLTKYPHIQASGRVWSAVPELLQVSTSGTARVARPHHLHIKLTGTHPATGELYNFNTCNCASFAVSLVEGPEPQNVTAAPWVKPKDGACCVVECTFLFRGVSTLLVSRGRAADTARVAVRAAPSLLWPHAAALLPGATIPVIGEGESLIPQSSQPRVAELIGRDGAPPHNYPDAQLFTLKCQRKGEALVELSSLQEERESVSLETWCSSHVTRVRLDPPDTQGNCSGPRLWLRPGQEVPIKVTLFDTIGRQLLDEDGPIMQWDVQPYHIGIGVKTTDRLFVETNSKYAPVPVPDKYYQLVVATEQAIGWSGSIKAAIPDTSATIQAKVVSPLKCDPMKVHIAWEGETVPNISAVTGGSGKYIVETPKGVTASVDEGRLSAVLPTPGTYDLIVADSCVSGEKNIVEVIIEEVLSVEVSTARAVCVDNCIPIRALVKGVSHRYLATSREPDWKTEGDIVVRKGQLCGLREGVGRVRASLGGVWSQSVEVWVFPSLAIVPERSRVAVGGRVHLNHAGGPPRHLASLLYTGGGEHAQVSSSGVIQGLYPGSTRVKLVAVDAANVELASAEAEIEVVPITTLRVRAATQTLLVGSPGPVWIEAAGLTATALSSLQPLPRVTWALRDPTMARIYTSHIDDRLERSVIEGLSIRVVPLKPGVITLDVRVRNMGQVAETRSWDSTIEILGVSDIRTSIEGLRDINSGEMLSLAVGSTVRLKSLPKGRWSSYQDGNTCRGRYPLLLYSGAGGVFGNVGVLEGCDEERPCARGSEITLSGLDSNGAFMSFESSLAGVTVSDEVFIPGSDAYANRIVATGGSALCIEGSGWTVPAGIQAVSGAGLTLAVLMSDAPAMHVLRLDRPPSTVNILQLPLSKMEFLPGEWPASLVPLSIQAEGLTSGPLLCTEEQKYALAGVDVDVPFSCRTAAPFAAQAVLDIPNGRHGCAILPGNEINEAVEVELCAEWGVLSTCTKVQLLPPIQLSQTRVSLLNPPSMFIINGHPNALKAVKITPSPGLKVETNSREGQISVTVKSESTTCGVGWVNVISKLTSQEIRVEVERECEIACGTLLGVLFSIMKPYLSTLVTVAVIAVGYLYVQNHLQQKGQIQLPKPPQTTLQTPLPEPRSRTWSRSPYASNPSAPVYGDTSMLPDASFSPTSTRIHSRLL-