Monarch geneset OGS2.0

DPOGS205572
TranscriptDPOGS205572-TA1356 bp
ProteinDPOGS205572-PA451 aa
Genomic positionDPSCF300099 + 544255-554338
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0060210.092.96% 
BombyxBGIBMGA008045-TA2e-10150.00% 
DrosophilaCG32052-PB7e-9742.70% 
EBI UniRef50UniRef50_E0VZL83e-9744.42%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VZL8_PEDHC
NCBI RefSeqXP_315822.42e-10744.57%AGAP005806-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3838597322e-10043.86%PREDICTED: acid sphingomyelinase-like phosphodiesterase 3a-like [Megachile rotundata]
NCBI nr blastxgi|910918766e-10247.30%PREDICTED: similar to AGAP005806-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL10824 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205572-TA
ATGACTCCAAGCAGCTTTACTCTTTACATCACCAGCCTGCTGTGGCTTGATCTGACACACGCTAAAATTGGTTACTTCTGGCACATCACAGATCTTCATTATGACCCGTTCCACAACTCCCTCGAGTTACATAGAGGTTGCAAGCACAGTCGTGGCAGCAATGAGCACAGCCACCGCAATGGCCGCCACCGTGACCACACGTGTGACAGCTCGTGGCCACTCATACAAAGCGCTGCCACATTCATGGCTGAACGACATCCTGATACGTTAGAATTTGTACTATGGACCGGGGACATCTTGTCGTCATCTATGGAACATCTGAGTGATGAAATCAAACTAGAGGCGGTCAGAAACGTAACTGACATACTAAGTCGTACATTTAGTAGCCAGTTTGTATTTCCAGCATTGGGTCATAACGATCCTCCACCATCCAGGAAGTTAGTAGACATGTGGATGCAGTGGCTGCCCACCGAGGCTTTACAGACTTTTGAGACAGGTGGTTATTATACCATAGAACAGTCTCACAGCAAACTCCGAATTGTCGTTCTTAATAGCGTGCTTTGGGCTGGAGGTGCGGCCAAGACTGACGGTCCTCACAGAGGACGAGCGCAGTGGGAGTGGCTTGAACACGTACTTAGTAAGGCAAGAAGGAAAAACGAAATGGTATACTTAGTAGCTCATGCTGGGCTAGGTGTTGAAGAACGTCATAACGCTGGAAGCTCGTCGGCGTCAGGCGGCGGAGAGCTTACTCCGACTGCTAACGCGAGATTATTGCATGTCATCAGAGCATTCAGTGACGTCATCGCCGGCCAGTTCTATGGACATCGGCACGCAGATACATTCAGACTTGTTTATAGTGAAGGGCGACCTGTATCGTGGGCGCTTTTAGCACCGTCGCTCACTCCACGAGGTGCGGGCAGTGTTTCGAATCCAGGATTAAGGCTTTATAAATTTGAGTCTAATACAGGAAAAGTATTAGATTATACACAGTATTATCTAGATGTAACTAATACAAGAGGTGAAGCGCATTGGGCCGTTGAATACAATGTCACACAATATTATGGTCTAAGAGAAGTAAGCGCAGTGTCATTGGACAATTTGGCTGAGAAGATTAGAAATTATCCAGATAGAACCTATCTTAACAAGTATTTATCAGCTCTGCGGGTGCGACACGCCACTGATGTATCTGAGTGTGACACTGCATGTATTCACGTGCATTTCTGTGCGATAACTCGAGCTGATTTCCATGAATTTCGCTCATGTGTCCGCAATCCTGCGTCGGCGCTAGCATCGCGCGCGCCTCATACTTCTGCTGCAATTTTGGTTTACGCCATGATATTTTTGCTATCGTCTTAA

Protein sequence:

>DPOGS205572-PA
MTPSSFTLYITSLLWLDLTHAKIGYFWHITDLHYDPFHNSLELHRGCKHSRGSNEHSHRNGRHRDHTCDSSWPLIQSAATFMAERHPDTLEFVLWTGDILSSSMEHLSDEIKLEAVRNVTDILSRTFSSQFVFPALGHNDPPPSRKLVDMWMQWLPTEALQTFETGGYYTIEQSHSKLRIVVLNSVLWAGGAAKTDGPHRGRAQWEWLEHVLSKARRKNEMVYLVAHAGLGVEERHNAGSSSASGGGELTPTANARLLHVIRAFSDVIAGQFYGHRHADTFRLVYSEGRPVSWALLAPSLTPRGAGSVSNPGLRLYKFESNTGKVLDYTQYYLDVTNTRGEAHWAVEYNVTQYYGLREVSAVSLDNLAEKIRNYPDRTYLNKYLSALRVRHATDVSECDTACIHVHFCAITRADFHEFRSCVRNPASALASRAPHTSAAILVYAMIFLLSS-