Monarch geneset OGS2.0

DPOGS207532
TranscriptDPOGS207532-TA4206 bp
ProteinDPOGS207532-PA1401 aa
Genomic positionDPSCF300177 + 428657-439193
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0219140.078.81% 
BombyxBGIBMGA001896-TA0.080.83% 
DrosophilanompA-PA0.053.14% 
EBI UniRef50UniRef50_UPI00022C97C70.057.70%UPI00022C97C7 related cluster n=2 Tax=unknown RepID=UPI00022C97C7
NCBI RefSeqXP_001120394.10.058.25%PREDICTED: similar to no mechanoreceptor potential A CG13207-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3072133170.059.06%hypothetical protein EAI_13009 [Harpegnathos saltator]
NCBI nr blastxgi|3072133170.059.83%hypothetical protein EAI_13009 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[928-1164] IPR0015072.4e-25Zona pellucida sperm-binding protein
[354-427] IPR0030144.4e-12PAN-1 domain
[350-428] IPR0036094.4e-11Apple-like
Orthology groupMCL12332 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207532-TA
ATGAAGCCGCGTGGGGCGAGTGTGATGCACGCCTTCACTGCTCTAACCATGCTGACTATGGCTAACGCCCAAACCACCTGTAATCAAGGGATGGGAAGGGTTATGTATGAACGACTACCGAACCAACAGCTCCACGGATTCGATGACGACGTTATACGAGAAACCGCACCGCCTTTCAGAGTCTTAGAGAAATGTCAGGATTTGTGTTTGCGGGATCGCTCTGGTAACAGTCTTGTACGGACTTGTAATTCGATAGATTTTCAACCAGGGGCTCGTATAGCGGCCTTTAGCCCGGAGCCGGAGTATGAGGAATCAACTTGTTATCTGACAAGGGAACAAGCAGCGCCTGAAGGCATCGGGACGCTCATGATTGTACCAAATAGCGTCCATTTCAACGAGATTTGCTTGACCTCTAATCGTCCTGAACGTGAATGTCCATCACGTCGCTACGTTTTCGAACGACACGCTCGTAAACGATTGAAGCTGCCACCATCAGATCTTAAAGAGATCATGGTGGCTAATAGGACCGAGTGTGAGGACAAGTGTTTGGGAGAGTTCAGCTTCGTTTGTAGATCGGCAACTTACGACACTGCTTTGAGAACTTGCTCCTTAAGCAGGTTCACTAGGAGAACCCACCCTGAACTTCTGGAGGATGACCATAATGCTGATTATTTGGAAAATACATGCCTTAATGCTGAACGTCGTTGTGATGGTTTGGCCGTGTTTATTAAAGAAGAGAATAAACGGCTCGGTGGACCTTTTGAAGCGGATGTTTTCTCAAATATGACACTTGATGAGTGCCAGTCTATGTGTGTCAGAGCGGAGAAATATTTCTGTCGTTCTATCGAACACGACGCTATGACAAGACAATGTGTCCTCTCAGAGGAAGACTCGGTTTCCCAAAAGGATGACGTGACTGTTAGCGCGTCACCCACACACCACTTCTATGATCTTGTTTGCTTAGATAATCTCTCTCATCGGGTGACAGCTCGCGGGACCGAGTACCCGGACAACAGCGTGACGTCACACCTGTTCTCCCCGGGCCGGCGCCCTGACACCGCCTTCCAACGGTATAGAAACAGCCGCATCACTGGAGAGTTCCATTCAGAGATCACTGGCCGATCTTTAAGCGAGTGTCTCGATGAGTGCTTGAGGCAGACTAGTTTTCAATGCAGGTCAGCGGTATACAGTGATCGCGCCAGAACCTGCCGTCTCAGCAGATATAACCAAAAAGATGGAATGCGCCTCTTGTATGATCCAGATTTCGATTACTACGAGAACCTTATGCATCAACTAGTTAGTGGAGAAAGCGAGACTGGGGGAACGGGCAGTGGTTCGAAATCATCGTGGGGTTCGAATACATCAGGTGGTAGTAATGGCGGCGATCGTGATAGATATCCGCCAGGTGTAGATCGTGACAGATACCCGGATGATGACCGATACTATGAGGACGATCGGTATCCGTCACGACCGGACAGGTATCCAGACGACCGATATCCGGCCGGGCCGGATCCTTACCCGGATAGATATCCGGGCGACCGCGACCGGTATCCAAGCGACAGATATCCGGAAGATAGATATCCGGATGACAGATATCCACCTGGTGTTGGACCCGACAGATTCCCTGAACGATATCCACCACCTATGGATCGGTATCCACCAGGAGTGGATAGATATCCTAGTAGATTCCCAATAGGGGACCGATACCCCATTGACGGTGGACGGTTTCCTATTTTCGATCAACCAAATGACATCGGCAGATACCCCGCTTCAAATCGCTACCCTGCCAACAGATATCCGATCGACCCGTATCTTGACCGATATCCGGAAAGAGATCCAGTAGACAGATACCCGGTTTCTGACAGGGGTCGATTCCCACGTCCGTGGTCTAACAGGTATCCTGAGAGGTTCCCACAGGACCGATATCCTAGTACTAACGATTGGGGACGCTATCCGTTCAGTAGGTACCCAGTGGGGGTAAATCGAGATCCCATTCCATTAGGAGGAGATCGTTACCCAGATTTGTATGATCGATATCCGCCGACCGGATCGTCATACCCTGGCTACGGACGGGGTGGTTATTATGGTTCAGAATATCCTGGAGGTGACAGGATCCCACCTGGTGACAGATATCCTGATAGAGGGGTTCGTCCAGTACCTGACAGATATCCTGTAGATCCCAGACCACCAATAGGAGTAAGGAGACCAATAGAAAGACCATTGGACAGGCCAGGAGGTTACCGTGGAAGCTATCCTGCTCCGGGTTTAGATAGACCTCTCGGCGGTTATGGAGGATACGAGCCTGATGGACCCATACCACCATTAGGTGCACCGGTAAATGTGCCAATTGGCGGTGCTATTGCTCCTATTGGGGGTCATATAGGCGGCCCAATAGGGGCTCCATTAGGAGGTCCTATTGGGGGAAGACCTCCGATTGGTCAGCGACCATGGCAAGGATCTAGATGTGAAGATGACAGTTTCAGACAAGTTGGTAGACAACGAATGCAAAGACGATTTGTTAGACGTTTCACAACAGCTCAGTCACTGGCCCATTGCCAAAGAGAATGTATCGAAGCTCGAGATTTCATATGTCGGTCATTTAATTACAGGGATGTAGGTTTCGGAGTTGAACCTCGGGATAACTGTGAACTGAGCGATCGCGATACTCGGGAGTTAGATGCTGCGAATCCAGCACATTTTGACAACACTGCAAACGAATATGACTTTTACGAAAGATCACTTGGACGCATGAATGAAGAATGTTTAGATGTGTCACAAGTTTGTAATGAAGATGGAATGGAATTCACTCTTCGATTGCCAGAAGGGTTCTTTGGTCGTATGTACACTTATGGTTTTTACGATCGCTGCTTCTTCCGCGGCAACGGAGGCGTGTCTAATGTACTCCGAATAACTGGGGCTCATGGTTATCCCGAATGTGGTACCCAGCGATATGGAGACACAATGACCAACATCGTCGTAGTACAATTCAGCGATAATGTTCAGACGTCAAGAGACAAGCGCTTCAATCTAACTTGTTTGTTTAGAGGTCCGGCGGAAGCCGTCGTTACTTCCAATTACATCGGCGCTGGGTCGGGAAGTCCTATACCCATTGAGTACTTGCCAGAGGAGAGTTCATTGAACTCCAAAGTGCGTCTGCTCATCCTTTACCAAGGGCGTCCAACAACTACTATAGCGGTCGGCGATCCTCTTACATTCAGACTTGAAGCACAGGATGGATATAATTACGCCACAGACATATTCGCCACAAACGTGATCGCCAGAGATCCATATTCAGGACGTTCTGTGCAGCTTATCGACAGGACTGGATGTCCCGTGGATCCGGATGTCTTCCCTGAACTAGACAAAGGACGTAACGGTGACTCTTTGGAAGCTCGTTTCAATGCTTTTAAAATACCCGAATCGAATTTCCTTGTGTTTGAAGCTACCGTGAGAACATGTAGGGATGGATGTCAGCCTGCGTATTGCCCGAGTCACTCTGGTAGATCGGAACCCTCATTTGGACGTCGACGTAGAGATTTAAACTCGACCACAGACGTCGGCAACAGCACGGATGCAGTGAGGACAGAAGATAACACCAATACAACTGATAATATAAGCGAAGCGTCCGTTTACAAAATATCTTACGAAGAGGCGACGGTAGATAAATATTTGAAAAATGATGTGGAAACGCCAAGCCACGTACGGAAGATGATTGAAGTGTTCGATAATCGAAACGAGCTGATAGAGGAAAATGGTCCTGATTCGTCTCCAGTAGTGGCCGCAGTGGAGCCCTGCGCGTCGCCAGCTCACCAGCGTATGCTGCTTGTTGCTTTATGCACAGTGCTAGCACTCTTATTGGCTATACTACTCGCTGCGCTTTATATTTACAAGCGCTACTGGCGTATAGTTCGCAAGAACATGTCAGTGACAGCCCCAGTGACTCGTGCTGCAGCACCCGTTCCTAGACATACAAGACCGTCTCTCTTCTCTGCCTCACATTTACATAAACCGTTTTCTTTAAGTGGTTTGGGCAGAAATTTCTCTGAGGTGGGCGAGGAGGCCGGGTCATCCGGACGTCTGGCAACCGCGTTCGATGACGGAAGCGAACCAATCTATACAGACCCCTCGCTGTTTGAACGATCTAGGTCACTCCGGTCCTTACATTCCCTGGACATGAAACCCGAGCGTCGCGAACATCGTTCTTAG

Protein sequence:

>DPOGS207532-PA
MKPRGASVMHAFTALTMLTMANAQTTCNQGMGRVMYERLPNQQLHGFDDDVIRETAPPFRVLEKCQDLCLRDRSGNSLVRTCNSIDFQPGARIAAFSPEPEYEESTCYLTREQAAPEGIGTLMIVPNSVHFNEICLTSNRPERECPSRRYVFERHARKRLKLPPSDLKEIMVANRTECEDKCLGEFSFVCRSATYDTALRTCSLSRFTRRTHPELLEDDHNADYLENTCLNAERRCDGLAVFIKEENKRLGGPFEADVFSNMTLDECQSMCVRAEKYFCRSIEHDAMTRQCVLSEEDSVSQKDDVTVSASPTHHFYDLVCLDNLSHRVTARGTEYPDNSVTSHLFSPGRRPDTAFQRYRNSRITGEFHSEITGRSLSECLDECLRQTSFQCRSAVYSDRARTCRLSRYNQKDGMRLLYDPDFDYYENLMHQLVSGESETGGTGSGSKSSWGSNTSGGSNGGDRDRYPPGVDRDRYPDDDRYYEDDRYPSRPDRYPDDRYPAGPDPYPDRYPGDRDRYPSDRYPEDRYPDDRYPPGVGPDRFPERYPPPMDRYPPGVDRYPSRFPIGDRYPIDGGRFPIFDQPNDIGRYPASNRYPANRYPIDPYLDRYPERDPVDRYPVSDRGRFPRPWSNRYPERFPQDRYPSTNDWGRYPFSRYPVGVNRDPIPLGGDRYPDLYDRYPPTGSSYPGYGRGGYYGSEYPGGDRIPPGDRYPDRGVRPVPDRYPVDPRPPIGVRRPIERPLDRPGGYRGSYPAPGLDRPLGGYGGYEPDGPIPPLGAPVNVPIGGAIAPIGGHIGGPIGAPLGGPIGGRPPIGQRPWQGSRCEDDSFRQVGRQRMQRRFVRRFTTAQSLAHCQRECIEARDFICRSFNYRDVGFGVEPRDNCELSDRDTRELDAANPAHFDNTANEYDFYERSLGRMNEECLDVSQVCNEDGMEFTLRLPEGFFGRMYTYGFYDRCFFRGNGGVSNVLRITGAHGYPECGTQRYGDTMTNIVVVQFSDNVQTSRDKRFNLTCLFRGPAEAVVTSNYIGAGSGSPIPIEYLPEESSLNSKVRLLILYQGRPTTTIAVGDPLTFRLEAQDGYNYATDIFATNVIARDPYSGRSVQLIDRTGCPVDPDVFPELDKGRNGDSLEARFNAFKIPESNFLVFEATVRTCRDGCQPAYCPSHSGRSEPSFGRRRRDLNSTTDVGNSTDAVRTEDNTNTTDNISEASVYKISYEEATVDKYLKNDVETPSHVRKMIEVFDNRNELIEENGPDSSPVVAAVEPCASPAHQRMLLVALCTVLALLLAILLAALYIYKRYWRIVRKNMSVTAPVTRAAAPVPRHTRPSLFSASHLHKPFSLSGLGRNFSEVGEEAGSSGRLATAFDDGSEPIYTDPSLFERSRSLRSLHSLDMKPERREHRS-