Monarch geneset OGS2.0

DPOGS210753
TranscriptDPOGS210753-TA1710 bp
ProteinDPOGS210753-PA569 aa
Genomic positionDPSCF300013 + 1032060-1108354
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0174770.091.16% 
BombyxBGIBMGA006288-TA0.089.53% 
Drosophilady-PA2e-16563.25% 
EBI UniRef50UniRef50_E0VTZ32e-16764.38%Cutilin-1, putative n=2 Tax=Neoptera RepID=E0VTZ3_PEDHC
NCBI RefSeqXP_002429587.13e-16864.38%cutilin-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420182416e-16764.38%cutilin-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420182419e-17656.20%cutilin-1 precursor, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[169-420] IPR0015075.1e-38Zona pellucida sperm-binding protein
Orthology groupMCL15820 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210753-TA
ATGATGCAAGTGCACATGTCGGTTCTAAACTTACCACATCCTCAGAGCGGATCTTATAGTGATCAAACGACAAATGAAGGGGGCGGCGGTGGAGGTTCACCTAATTCCGACACCAACAGCGTTGCAGAACCCTCTAGCGACCAACTGGCGATGGAATCATCAGAACGAGAAACCCCAAGTCACGCTTACAACGGTCCACCGCCCCCAGTGCCACCTCCTCACGTTACAAATCGAAGACAAAATGGGCCCCACCACCAGTCACACCATCCAATAGGTATGCCTCGTATTTCTAATCATCAGCAAATTCATTCAAAGCCATTCGCCCTTGGACCTCCTGTGAATCATCATAAAAATGACATTGGACCTGGATTCCGAGGGCCGCCACCACCCCAAGCTCCACCAAGTGACGCTCAAGCCTCGGCCAGTGACAAGGTCTACAGCACTACTGGTGACGTTTGGCCTGCTCCAGCTCCTGATATGCCGAAAATTTTATCCCTCGACGTCAAATGTGAAAAGAATGCCATGAGAGTATTTCTCAGTTTCGACAAACCATTCTTTGGTATCGTTTTCTCCAAAGGTCACTATTCTAACCATCAATGTGTTCATCTTCCACCAAATTTAGGCAGATCTTCGGCCTCCTTTGAAATTGGTGCGCACGCATGTGGCACAGCTGGAAGCGGAGATCCAAGATACAGGAGCGATGTTGCAGCAGCTGGCACGTACTTTGAAAATGTTATTGTCATACAGTATGACCCGCAAGTTCAAGAAGTTTGGGATCAAGCACGCAAACTGCGGTGCACGTGGCACGACCAGTATGAAAAAGCTGTTACCTTCCGCCCCTTCCCCGTAGATATGTTGGATGTGGTTCGTGCTGACTTTGCTGGAGATAATGTGGGATGCTGGATGCAAATACAAGTTGGTAAGGGTCCTTGGGCTTCTGAAGTATCCGGATTAGTTAAAATAGGTCAGACTATGACTATGGTATTAGCGATTAAAGATGATGATGCAAAGTTTGATATGTTAGTTCGTGATTGTGTAGCTCACGATGGTCAACGCGCCCCTATACAATTAGTCGATAGGCGTGGCTGTGTAACTAGACCAAAACTAATGTCGAGGTTCACAAAGATAAAGAATTTCGGAGCTAGCGCATCAGTGCTCTCATACGCGCATTTTCAAGCTTTCAAATTCCCAGACTCCATGGAAGTACATTTCCAATGTACTATTCAGATTTGTAGATACCGATGCCCCGAACAATGTACTGATGCACCTCATAATGTTATTGGCCCTCACGCTGAATACGGACCACCACAAATTGATCAGTCATATCCCGTAAGTGTTGAAATAAGGAGGGATGAAAGAAGAGTAAGGAGGCAACGTAGAGCCACATCACCTGAAAAGGAAGTCGGCGTTAACAGAGTCATCCGAGTTGTGTCCGCTGGAGACCTGAATTTAGATAATAATGAAGAATCGATCACTCCCAAAATTGTTCCGACGCCAGGACTTGTTTGCATGACAACGCCAGGATTTGCTGCAACACTTGGAACACTTCTTGCCACTCTTATATGTTCGTGCGCAGTGTCTGCAGTATTATTCTTTAAATTACGTCCCATTACAAAACTTAAAAAGAAAACTGCTGCTATTAGCACGATACCACGACCACACCCAACAACAGGACCACATGTCATTTCGAAAAGCCGATTTTATTCGTAA

Protein sequence:

>DPOGS210753-PA
MMQVHMSVLNLPHPQSGSYSDQTTNEGGGGGGSPNSDTNSVAEPSSDQLAMESSERETPSHAYNGPPPPVPPPHVTNRRQNGPHHQSHHPIGMPRISNHQQIHSKPFALGPPVNHHKNDIGPGFRGPPPPQAPPSDAQASASDKVYSTTGDVWPAPAPDMPKILSLDVKCEKNAMRVFLSFDKPFFGIVFSKGHYSNHQCVHLPPNLGRSSASFEIGAHACGTAGSGDPRYRSDVAAAGTYFENVIVIQYDPQVQEVWDQARKLRCTWHDQYEKAVTFRPFPVDMLDVVRADFAGDNVGCWMQIQVGKGPWASEVSGLVKIGQTMTMVLAIKDDDAKFDMLVRDCVAHDGQRAPIQLVDRRGCVTRPKLMSRFTKIKNFGASASVLSYAHFQAFKFPDSMEVHFQCTIQICRYRCPEQCTDAPHNVIGPHAEYGPPQIDQSYPVSVEIRRDERRVRRQRRATSPEKEVGVNRVIRVVSAGDLNLDNNEESITPKIVPTPGLVCMTTPGFAATLGTLLATLICSCAVSAVLFFKLRPITKLKKKTAAISTIPRPHPTTGPHVISKSRFYS-