Monarch geneset OGS2.0

DPOGS208550
TranscriptDPOGS208550-TA2211 bp
ProteinDPOGS208550-PA736 aa
Genomic positionDPSCF300064 + 1039726-1050713
RNAseq coverage602x (Rank: top 21%)
Annotation
HeliconiusHMEL0166036e-16250.08% 
BombyxBGIBMGA010324-TA1e-10662.01% 
DrosophilaCG17111-PB2e-6426.70% 
EBI UniRef50UniRef50_D6X3324e-7730.18%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X332_TRICA
NCBI RefSeqXP_969426.18e-7830.18%PREDICTED: similar to CG17111 CG17111-PA [Tribolium castaneum]
NCBI nr blastpgi|2700138692e-7630.18%hypothetical protein TcasGA2_TC012533 [Tribolium castaneum]
NCBI nr blastxgi|2700138696e-7930.03%hypothetical protein TcasGA2_TC012533 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[309-564] IPR0015071.4e-19Zona pellucida sperm-binding protein
[223-290] IPR0030142.9e-06PAN-1 domain
Orthology groupMCL17004 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208550-TA
ATGATGCAACGGAGAATTTTCGTGATCTTGTCTATATTTACATTTATTACAAGTAGCGAATCACAATGTCGTCTGCCGGAATCAGAGACCTTCGTGAGATGGTCAGGTATGACTCCGGACCAGGCAACACCTGTATTTATGTACTCCGCTAGTGGGGAAGAAGAATCTCTAACAGGAGCCTGCTTGAGCCGATGCCGCGAGCTCGCAGATTGTGCTGCTGTCATCATTGTTTATACTAAAGGAAGCTGTCAGGGCATTTCTAGTTCACGGACAAATCAGCTGCGACCGGATAACGAAGTCGCTTATTTTAACAAAGTATGCCTAAAATTGCCAGACAATTGTAAGAAACTTTGGTGGGCGCTAGAGAGTACTCCAGGATATTATTTGAATTCTGATGGACCTGATGTCAAGGTCGTTGTAAACTCAACAGTTCAGGATTGCTACAATGCGATGTTTTCTACAAACGAAAAACGATACCGATCGGCGCAATGGATCGAACCTGGAAGTCCGTTAGACAGTTATTTTGTAGCACAAAAAGAGATAGGGAATTGTATATTAAATGGAGAAAATAAGTTCACTGAACCCGAGTCCTACCGAGTGTCAAACTCGTATACCTTCTACATAGAGAATCAGTGCTCCCATGATTATCCAAAAAAAATTGACAGATGTTCGTATGAAGAATATTACAATCAAACAGTCAAGCATGTTGATTTGACAGCGAACAATTTCAGCAAAGACGAGTGTAAAACAGCGTGTGAGCAGGAAAATCGTTTCGTCTGCAGGGGATTCACTTGGATAGCGTCTTCCTCTCGCGGTATATGTGATCTCCACAGTGAAGACCTGGTAACAGCTGGCTCCTGGCTCCTGAGACGAGTGTCCGGGGCATCGTATTATCGTCGCGTTATATGTCTTAACATCAGCGTGGAATGTTCACCGTCTCACTTAGTAGTGACATACAGACCTCATGGTATGTTCCGTGGGAGGGTGTACGTCCCCGGGCGGGGCGAGCGTTGCAGTGCGAGGTCATTGACACCGGCCTCACACGTCCGCCTCGCGTTGCCTCTATACGGCGATTGTGACGTCAACTTCGCATTCGCCATCTCTAAAACACCAGCAGGCATCGTTAATAGAACTATGGCGTATGTGATGCTCATGATTCAGAACAACCCGATCATACAGACAGCGGGAGATCGCTGGGTGAGAGTGGGGTGCTCGCCTGGAGACCGACAAGGGTATACTAAAGTGGACGCCACAGTCGCTGTTCAGGAGTCGGGGCGTCCGTCTGTTGCGAGCGAATCGGGCGAGGTGTCTGATAAACTGGGTGCCAGCGCTGTCCTCGGGACAACGCCACCTCTCACTATGTACGTGGTGAGAGCAACCGAAGACCAAGGGACGGGAGCCGTGGCCCTGGGAGATCTGCTTGAACTAAGGATAGAAACTACTGGAGATTCTGAAATTGAGGCGTATCATTTAGTAGCGTCTTCGAGACTTGGAGACAGTTCTGTGTTATTGTTGGACAACAGCGGATGCCCCACGGGACAGGTCGACTTCCCTTCATTCAGTCGCTCTCGTTCAGGAGTGAGTCAGCGCCTCTTCTCCCGGTTCAAGGCGTTCCGTTTTCCTACGTCTCACGTAGTTCGCTTCGCTGTCGTTGTACGATTCTGTCAAGATAAATGTGCTCCGATCAACTGTGAAATGTTGGATAGACTCAGAGACGCGAGAGGCGCGAACGAGACTTACACGACTGATTCAGAAGTAGCGGCGAGTGTTAAGGAGGAGACATCATGGCCGACGGGAGTGGTGGCGCAGGGAGGGCCGGTGATGTGTATAGGGGAGGGACAGGGTGAGGTGTTGGGGATGGAGAAGAGAGTACCCTTGGAACTGGAATTAGTGGTGGGGGCGAGAGATGTACTATCAGCGGACACACTCGTGCGCGCCGACCACAGAAGTTCTCTACCGGAGGTTGACGTCAGCAGTCCCTTGGTGTGTGTGCACGAGTTAGTGTTGGTGTCGCTGATGCTGGCGTGGCTTGCGGTGCAAATACTGCTGCTGCTGGGCTGCTGCGTCCTTGTTAAACGATACAGAAATCTAGCAGAAATGAATATGCAAAAAGACTACCATTCGTTTGACAACATTGGTTTTGACAACGTGTCAACACACAGGCGGGTTCATTGGCCGGATCAAAACATAGATATAATACATACAAATTAA

Protein sequence:

>DPOGS208550-PA
MMQRRIFVILSIFTFITSSESQCRLPESETFVRWSGMTPDQATPVFMYSASGEEESLTGACLSRCRELADCAAVIIVYTKGSCQGISSSRTNQLRPDNEVAYFNKVCLKLPDNCKKLWWALESTPGYYLNSDGPDVKVVVNSTVQDCYNAMFSTNEKRYRSAQWIEPGSPLDSYFVAQKEIGNCILNGENKFTEPESYRVSNSYTFYIENQCSHDYPKKIDRCSYEEYYNQTVKHVDLTANNFSKDECKTACEQENRFVCRGFTWIASSSRGICDLHSEDLVTAGSWLLRRVSGASYYRRVICLNISVECSPSHLVVTYRPHGMFRGRVYVPGRGERCSARSLTPASHVRLALPLYGDCDVNFAFAISKTPAGIVNRTMAYVMLMIQNNPIIQTAGDRWVRVGCSPGDRQGYTKVDATVAVQESGRPSVASESGEVSDKLGASAVLGTTPPLTMYVVRATEDQGTGAVALGDLLELRIETTGDSEIEAYHLVASSRLGDSSVLLLDNSGCPTGQVDFPSFSRSRSGVSQRLFSRFKAFRFPTSHVVRFAVVVRFCQDKCAPINCEMLDRLRDARGANETYTTDSEVAASVKEETSWPTGVVAQGGPVMCIGEGQGEVLGMEKRVPLELELVVGARDVLSADTLVRADHRSSLPEVDVSSPLVCVHELVLVSLMLAWLAVQILLLLGCCVLVKRYRNLAEMNMQKDYHSFDNIGFDNVSTHRRVHWPDQNIDIIHTN-