Monarch geneset OGS2.0

DPOGS204485
TranscriptDPOGS204485-TA1797 bp
ProteinDPOGS204485-PA598 aa
Genomic positionDPSCF300002 + 1121280-1131408
RNAseq coverage812x (Rank: top 16%)
Annotation
HeliconiusHMEL0156931e-10875.30% 
BombyxBGIBMGA007829-TA0.091.83% 
Drosophilatyn-PB0.075.00% 
EBI UniRef50UniRef50_Q8MS370.075.00%RE15579p n=39 Tax=Pancrustacea RepID=Q8MS37_DROME
NCBI RefSeqXP_394451.20.078.43%PREDICTED: similar to SP71 CG17131-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838484280.078.22%PREDICTED: uncharacterized protein LOC100876152 [Megachile rotundata]
NCBI nr blastxgi|3838484280.078.49%PREDICTED: uncharacterized protein LOC100876152 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[255-480] IPR0015073.9e-36Zona pellucida sperm-binding protein
[12-95] IPR0030141.2e-15PAN-1 domain
[13-94] IPR0036091.4e-08Apple-like
Orthology groupMCL15873 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204485-TA
ATGGTTAAAATTCAAGTGAGGTCAGAGAATGTGTGTATGAGGCCCTGGGCGTTTGAACGTGTACCCGGCAAGGCTCTAAGAGGCCTCGACAACAGTATTATCTACACCACCACCAAGGAGGCTTGTCTTGCTGCTTGCCTTAATGAGAAAAAGTTTCCGTGCCGTTCAGCGGAGTACGAGTATGGCAGTATGAGATGTTCTTTGAGTGATTCAGATCGTCGCACCGGACAGCATTTTGTACAACTAGTCGACACACCCGGCACTGATTATTTCGAGAATCTATGCCTGAAGGCGTCCCAGGCGTGCAAAGGAGCGAGGGTATTCACGGCACCACGCGTGGGCGTTGCTGAAGACAAAGTGGCACAATATGCTGGCTTGCATTATTATACTGATAAGGAGCTACAGGTAACGTCGGAGTCAGGATGTCGACGTGCTTGTGAGATAGAATCAGAGTTCCTGTGCCGTTCCTTTTTGTACCTCGGAGCGCCGCATTCGTCCATCTACAACTGCAGGCTGTACCACCTCGATCACCACACGCTACCTGATGGGCCCTCAGCTTACCTGAACGCTGAACGTCCGCTCATCGACGACGGCGAACCGATCGGCAAATACTTCGAGAACTTCTGTGAAAAACCACCAGCCAATCCCAGTGGAGAGCTGCCTGTTACTATAGACCATCAACAGGATGTCAACATGTCCAGCAACTTAACAAGAAACGATGCGAACTGTGACAAGACCGGAACTTGCTATGACGTATCCGTCCACTGCAAAGATACCAGGATCGCGGTACAAGTCCGTACGAACAAGCCTTTCAATGGAAGAATCTATGCACTAGGCCGCTCGGAGACATGTAACATAGACGTAGTTAATAGCGACCTATTCAGACTTGATCTCACAATGGCCGGTCAGGATTGTAATACCCAGAGCGTCACTGGCGTTTATTCAAACACTGTAGTATTGCAACATCACAGCGTTGTTATGACGAAAGCGGACAAAATCTACAAAGTGAAGTGCACATACGACATGAGTTCGAAGAACATTACATTTGGAATGGTGCCCATCAGGGATCCGGAGATGATCTCCATCACTGCAGCACCTGAGGCACCTCCACCGCGCATTCGCATCCTTGATAGCCGACAACGCGAGGTTGAAACTGTCCGTATTGGAGACAGACTCACCTTCCGTATCGAAATTCCCGAAGATACTCCATACGGCATTTTTGCACGCAGTTGTGTCGCTATGGCTAAGGATTCTAAGAGCACGTTCCAGATCATCGACGACGATGGATGTCCAGTCGATCCATCAATATTCCCAGCATTCAATCCCGACGGTAACGCATTGCAGTCCGTGTATGAAGCCTTCAGATTCACCGAATCTTACGGTGTTATATTCCAGTGCAATGTGAAATACTGTCTGGGACCATGTGAACCTGCGGTTTGTGAATGGGGCAGAGAATCAATAGAGTCATGGGGCAGAAAGAGACGTTCTTTACCTAACAACGAAACCAGTGAAACTCATTCTCAAGAGGAAGACATGAATATTTCTCAAGAAATATTGGTTCTTGACTTTGGTGATGAAAGACAGAGTACTGACTTCCTCCGATCCGATAAACCTGGTGGCTCAGCATCAGAGACCAACTTTGGAGAAAAAACAGTAACCATTGTGGAGCCATGTCCTAGCAAGTCATCCGTGCTGCTGCTAGGAGTTGCCTGTGCCTTACTGGTACTTTTGTACATCGCGACCATCTTCTGTTACTACATGCGTAAATGGCTGGCGCCACCTAAGCATTTGTCGTAA

Protein sequence:

>DPOGS204485-PA
MVKIQVRSENVCMRPWAFERVPGKALRGLDNSIIYTTTKEACLAACLNEKKFPCRSAEYEYGSMRCSLSDSDRRTGQHFVQLVDTPGTDYFENLCLKASQACKGARVFTAPRVGVAEDKVAQYAGLHYYTDKELQVTSESGCRRACEIESEFLCRSFLYLGAPHSSIYNCRLYHLDHHTLPDGPSAYLNAERPLIDDGEPIGKYFENFCEKPPANPSGELPVTIDHQQDVNMSSNLTRNDANCDKTGTCYDVSVHCKDTRIAVQVRTNKPFNGRIYALGRSETCNIDVVNSDLFRLDLTMAGQDCNTQSVTGVYSNTVVLQHHSVVMTKADKIYKVKCTYDMSSKNITFGMVPIRDPEMISITAAPEAPPPRIRILDSRQREVETVRIGDRLTFRIEIPEDTPYGIFARSCVAMAKDSKSTFQIIDDDGCPVDPSIFPAFNPDGNALQSVYEAFRFTESYGVIFQCNVKYCLGPCEPAVCEWGRESIESWGRKRRSLPNNETSETHSQEEDMNISQEILVLDFGDERQSTDFLRSDKPGGSASETNFGEKTVTIVEPCPSKSSVLLLGVACALLVLLYIATIFCYYMRKWLAPPKHLS-