Monarch geneset OGS2.0

DPOGS210943
TranscriptDPOGS210943-TA2229 bp
ProteinDPOGS210943-PA742 aa
Genomic positionDPSCF300004 - 1550036-1555715
RNAseq coverage504x (Rank: top 25%)
Annotation
HeliconiusHMEL0071440.089.96% 
BombyxBGIBMGA006371-TA0.083.27% 
Drosophilaneo-PB0.059.65% 
EBI UniRef50UniRef50_Q9VAG20.059.65%CG7802, isoform A n=24 Tax=Neoptera RepID=Q9VAG2_DROME
NCBI RefSeqXP_968199.10.072.15%PREDICTED: similar to AGAP002316-PA [Tribolium castaneum]
NCBI nr blastpgi|910794820.072.15%PREDICTED: similar to AGAP002316-PA [Tribolium castaneum]
NCBI nr blastxgi|910794820.071.99%PREDICTED: similar to AGAP002316-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[383-618] IPR0015072.5e-30Zona pellucida sperm-binding protein
[209-287] IPR0030145.8e-17PAN-1 domain
[205-286] IPR0036097.9e-12Apple-like
Orthology groupMCL11823 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210943-TA
ATGCGGTTATTACTTAGTTTAGTAGTGGTGATTTGTGCTGTCGATGCAGCCAAAAGGTTCGAAGGAACTTTGAGATCATCAGCCGATGCTCCTCCTCAAGACAACCTGGCTGTTGAGTCAGGAGCACCTGAACCTGCTATCGTTGCTGCACCTCAAGAATACACTAATCCTGGCGCACCACCTCCTGAAACCCTGAAAAATGCTGAAGAAATAGAAGAAGAAAAAGAACAAGATATAGAGCCACCAGCATCCGCTCCGGAAACCGCGTCCGGTGGAGTTCCTCCGTCAGCGCCTAGTGGTATTTCTGCTCCCTCGGCACCCGCCAATTCTCTCGAAGAATGTGATCCGGAGAAAATTGGATTCGAACTGGTCACTGGATATGTATTCTCTGCGCCATCACATATTCTCGACGACATCCCCGGCACACTTATGTTGACTGATTGTTTAGAGCAGTGTCAAGCTAACGACACTTGTCGCGCCGTCAACTACGAAACTGGTCTATGCGTGCTCTTCAGCTCTGACGCCGATCAATTGCCCGGAGCTTTGACAAAATCCCAGTTTCCGGTATTCACGATCTACGCTCAGAAATCGTGTCTGGGAGTGAAGCCGTGTGAACGAGCTTGGTGTTTCGATCGCGTTCGCGGATACAATCTCAAGGGATTCGGCAAGAGAACGCATACCGTTGAATCCAGACAAATGTGCCTCGATCTTTGTCTAGGAGAAAATGAATTCGTTTGCAGATCGGCGAACTATAACAACAAAACAGGTGAATGCGTTCTGTCGAACATGGATCGTATCACTTTAGCTGGAACCAGCGCTTTCCAACCAAACGAGGATGTTGACTACTTGGAGAATAATTGTGTGGAGGAGCCTACAAAGCTTTGCGAGTTTAAAAAGATGAACGGACGCATTCTCAAGACGGTGGACTCGGTGTATCAGGATGTCCAAACGATCGAGGAATGTCGTGAATTGTGTCTCAATTCGCCTTTCCGTTGCCATTCTTATGATCACGGGGACACGGGAGATCATGTGTGCCGTCTTTCCCACCATTCAAAAGCCACGCTCGCTGATATCCAGGATCCCTACTTGGAAGTACCCGAAGCTGCCACTTATGAACTTTCTTCGTGCTACAATGTATCCATTGACTGTCGGGCAGGTGACATGGTAGCTCGAATTCAGACATCTAAATTGTTCGATGGAAAAATTTATGCAAAGGGAAGTCCCAATTCATGCGTTGTCGATGTTAAACAAAGTCTGGAATTCGAACTTCATATGGAATATAATAATATCGATTGCAATGTTAAGCAAAATGGACTTGGAAGATATCTGAATGACGTCGTTATTCAACATCACGACACTATCGTTACTTCTTCTGATCTTGGTTTAGCGGTAACTTGTCAATATGACTTGACCAACAAGACTGTAGCTAATGAAGTCGACCTCGGAATTCAGGGTGAGATCCAGACAGGATTAACAGAGGAAGTTATTGTGGACTCACCCAACGTAGCCATGAGAATTACTGATAGAAGTGGAGACGACACTATTGTTTCTGCTGAAGTTGGAGATCCATTGGCACTTCGTTTCGAAATCATGGATCAAAACTCACCATTCGAAATTTTTGTTCGAGAACTTGTCGCAATGGATGGCGTCGACTCCAGTGAAATTACTCTCATCGATAGCTATGGTTGCCCAACTGATCATTTTATCATGGGACCCCTCTATAAATCTACTGCAAGCGGAAAGACCCTGCTTTCACACTTTGATGCGTTTAAGTTCCCATCATCAGAAGTAGTACAATTCCGCGCCTTAGTGACACCCTGTATGCCGACTTGCGAACCCGTTCAATGTGACGGAGGTCCAAATGAATTGCGCACAGTTTCATCATATGGACGTAGGAAGAGACGTTCGACAACTCCCACTGACGATATGCTTCTCGTCCAGACTATTCAAATCACCGACAAGTTCGGTTTCGACAAACAGAAAGCAAAGAACGTCACCGAAGACAGCGTTTACATCAGAGAGAGCGATGCTACGTGTGTTAATGCTGCTGGTGCTTTATTGGCTGGAGCGGCATTCATTGCCGTACAGTTAGTGGTATTGGCTGCATGGACCTGCAGTTGGCAGCGCCGACGAGCAGCTGCTAAAGCTGAGCTCCTGCCTGGACCAAACCCTAATTCACTCTGCAAAGTCTATGATGCCGGTTTCTCCCGCGCCCAGAGGCACTTCTGA

Protein sequence:

>DPOGS210943-PA
MRLLLSLVVVICAVDAAKRFEGTLRSSADAPPQDNLAVESGAPEPAIVAAPQEYTNPGAPPPETLKNAEEIEEEKEQDIEPPASAPETASGGVPPSAPSGISAPSAPANSLEECDPEKIGFELVTGYVFSAPSHILDDIPGTLMLTDCLEQCQANDTCRAVNYETGLCVLFSSDADQLPGALTKSQFPVFTIYAQKSCLGVKPCERAWCFDRVRGYNLKGFGKRTHTVESRQMCLDLCLGENEFVCRSANYNNKTGECVLSNMDRITLAGTSAFQPNEDVDYLENNCVEEPTKLCEFKKMNGRILKTVDSVYQDVQTIEECRELCLNSPFRCHSYDHGDTGDHVCRLSHHSKATLADIQDPYLEVPEAATYELSSCYNVSIDCRAGDMVARIQTSKLFDGKIYAKGSPNSCVVDVKQSLEFELHMEYNNIDCNVKQNGLGRYLNDVVIQHHDTIVTSSDLGLAVTCQYDLTNKTVANEVDLGIQGEIQTGLTEEVIVDSPNVAMRITDRSGDDTIVSAEVGDPLALRFEIMDQNSPFEIFVRELVAMDGVDSSEITLIDSYGCPTDHFIMGPLYKSTASGKTLLSHFDAFKFPSSEVVQFRALVTPCMPTCEPVQCDGGPNELRTVSSYGRRKRRSTTPTDDMLLVQTIQITDKFGFDKQKAKNVTEDSVYIRESDATCVNAAGALLAGAAFIAVQLVVLAAWTCSWQRRRAAAKAELLPGPNPNSLCKVYDAGFSRAQRHF-