Monarch geneset OGS2.0

DPOGS213604
TranscriptDPOGS213604-TA1197 bp
ProteinDPOGS213604-PA398 aa
Genomic positionDPSCF300033 + 722409-733301
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0136735e-9878.45% 
BombyxBGIBMGA011666-TA0.081.43% 
DrosophilaCG13196-PA5e-1126.67% 
EBI UniRef50UniRef50_E2C9Q52e-13557.18%Putative uncharacterized protein n=9 Tax=Neoptera RepID=E2C9Q5_HARSA
NCBI RefSeqXP_001809747.18e-15162.62%PREDICTED: similar to AGAP009011-PA [Tribolium castaneum]
NCBI nr blastpgi|1892352121e-14962.62%PREDICTED: similar to AGAP009011-PA [Tribolium castaneum]
NCBI nr blastxgi|1892352123e-14462.62%PREDICTED: similar to AGAP009011-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[126-272] IPR0015071.5e-09Zona pellucida sperm-binding protein
Orthology groupMCL17020 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213604-TA
ATGACGGGCGCCACCAGCTGTCTTATAGTGCTGCTGCTTTTGGTTCCGGCCATCTCAGCACAAGATTACGACGTGACTGATATCCAATGCACATTCGCGACCACGGGCTCCGGTATAAGGGACTCTGTATCAGCATTGTTAAGGAAGCCCGAGGGGTTCCGCGGGGCTCCGTTGTTCGCTGACGACCGCGCTACAGACCCTATCTCAGATTCCGGTTTCGTGCACGTTCGCATCTGGTTCCCCCAGTTCCCGGGGGTGGTGATGCAATCAGACCAGGAACTGATCATCATGTGCAAGCCCCCCGAGCCCACCATCATCGAGAACAAGGCAGCAGGATTTGCGGGTAGCTTTCCGCACGGCGCTCGCGTTTCCGGCGTCGTCGAAGAAACTCCGGGCCGTCTTGAGTATGAAGTAGCGCTGTATAAGGAGGCGCCCCCTGTGTCCCGACACTCAAACCACTCATTGGATATGCCTGTTGACCAGGCTGTTCCAATCGGAACTAAATTACAATTAAGAGCACGCATCAACCCGGATTCAGCCTGGCGACATATCAAACTCCTAGAGGTCGCTGTGTCCCCCGACCCTGATAGACCTCACGCTAATGGAGCCGTGTTACTCGTGAAAGACGGCTGCCGGAACAGAGATTTCGCATCTATCATACCACACCAGCCGGCCAGGTACAGGGAGCGTCATAACGAAGTTTTTTTGGACTTCGAAGCGTTCCTCTTGGCTTCCATGAAGGAGCGTTCCACTTTATGGATCCACTCACAGATCAAGGCGTGTATGGACGCAGCTGACTGTCAACCGGACTACTGCCTCGACTTATATGAACCGTCAGGTCACGGTCGTCGTAGAAGATCGCTGCCAGAAAACGAGACAAAGACTATCTCAGACAGCCAATATACGCGGTTCAAAGAGAATCTGGAGTACTCGGTGGTGATGCCGGGGGAGTTGTTCCACAAAAAGTCTTTGGAGGCGACGTGTGCCACCTCCATGATGGTCGCGGTCGCCCTCGGAGCTCTGCTCTTCATGTCCGCCTTATTGATGTGCTATCTCGCTACTAAGTTGAATTCAACGATGCTCAAAAACAGCAGTCTTCAAACGCCAACTGGGAAAGGATTTGAACAAATATTAAGAGAACTGGCGCATCACTCACTCCCTGATACGGGCTACACGGGTCGCCCCACCGTACAATAA

Protein sequence:

>DPOGS213604-PA
MTGATSCLIVLLLLVPAISAQDYDVTDIQCTFATTGSGIRDSVSALLRKPEGFRGAPLFADDRATDPISDSGFVHVRIWFPQFPGVVMQSDQELIIMCKPPEPTIIENKAAGFAGSFPHGARVSGVVEETPGRLEYEVALYKEAPPVSRHSNHSLDMPVDQAVPIGTKLQLRARINPDSAWRHIKLLEVAVSPDPDRPHANGAVLLVKDGCRNRDFASIIPHQPARYRERHNEVFLDFEAFLLASMKERSTLWIHSQIKACMDAADCQPDYCLDLYEPSGHGRRRRSLPENETKTISDSQYTRFKENLEYSVVMPGELFHKKSLEATCATSMMVAVALGALLFMSALLMCYLATKLNSTMLKNSSLQTPTGKGFEQILRELAHHSLPDTGYTGRPTVQ-