Monarch geneset OGS2.0

DPOGS215338
TranscriptDPOGS215338-TA1728 bp
ProteinDPOGS215338-PA575 aa
Genomic positionDPSCF300120 + 398028-399824
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0118620.071.70% 
BombyxBGIBMGA007972-TA0.065.57% 
DrosophilaCG10347-PB2e-5126.04% 
EBI UniRef50UniRef50_D6WDT83e-10437.85%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDT8_TRICA
NCBI RefSeqXP_967066.16e-10537.85%PREDICTED: similar to NudC domain containing 1 [Tribolium castaneum]
NCBI nr blastpgi|910794101e-10337.85%PREDICTED: similar to NudC domain containing 1 [Tribolium castaneum]
NCBI nr blastxgi|910794105e-10337.85%PREDICTED: similar to NudC domain containing 1 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[251-359] IPR0089787.5e-18HSP20-like chaperone
[272-344] IPR0174471.4e-08CS domain
Orthology groupMCL15089 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215338-TA
ATGCCGTCTGTACTTGTTGAGTTGAGACCTAATAAAAAGCTTCTCGATCATGAATTTGAAGGATACAAAATAAATCTACAAAGTCTAGCTCAATATAATTTGAAACTAACCACTACAGCCGACCGGCTCTATCCCGATGAAGTGCAATACTCCTTCGTACATGCGAAATTATTTGCTTTGCACAATCACCTAATATTTGATGTATGGGATCATCAATACAACTTTTACTATATTGATAATCAGCAGCAAGTTATTAACGTCACATTTGATAATATAAACAATTCATTCCAAACGGCTGTAGTATATGATATCCCACAACATGTTGAAAGAAAACCTGGCCATTTCAACCTTTGTTTAACATTTCCTGCAAACTATTTAGCAATAATCAGTGATGGTACTGGTTTCCTACACATAGTTGATACCGGGACAAGAAACAGACCTGCCACTGATAAACAACCATGGAACACAATACATTCAAGTCTAGCTCTCGGTGAGGGTAAATATTTTACTGTTGTTGATAGTAGAATGCAAAAAAAGTGTAATGTTGACATATTACATTGTCTGTTACAATCTGTTGAACAGAATGACAATCATTTTGATACAGTTTTGACCTGGCTTTCGTTTGAATTTATTGATCATAAATGGAAACAGTTTGCAAAGAGGGAGGTCAAAGGTAAAGGTATAGTACACTATGCAGCATTTGAAACCCAATGTCAAGCTTTGTACATAGCATCTGACAATATGTTTAAATTTACTTCTGATTCTGTTAAAAATATATTACCACCACAACAACCAGAACAGAAACCAATAATATACACATGGTTGCAGACGGCTGATGATATCACAGTTACTCTGAAATTGGAAGATAATTTTGACAAAAAATTGCTCTTTGTACATGTAACGCCATTATCAATTAAAATACGTTATGCAGGAAAAGAGTTTGTGAGTGGTAAACTGAAAAATAAAGTTGACAGTGAATTGACAACTTGGAATATACAGGATAACGGGCAAGTAGATGTTTTGATAACAAAGTCTGAAAGTGACATGTGGAACGAACTTATTGAGGGAGGGGACCAGAATGGAAAAGAAATATTGGATGCTTCACTTGTTGAAGAGATCCACCAAAGATTGGCTCATTTATGTTCAGAAACAGAGGTTATGTCCGACCAGCCTCTAACAAGCCTCGCTTCCCAGGAGTTAGAGGAATGTGATGCTGCATCTGAGGAAGACACAGTCTTAGCACGGTTAGATACAACAACCCATGAAATAACCCACAGGATACCCCTCAGTGTGCATCATTATTTATTCAGCATTAACTTAGAAACACATGAAGCACCGGCCCTGGTCTTGAGACATGATGTGGATGGATGTGTGTGGCAGCCCTTCACACAACCTTTCAACTCAGACTCTTGGCCGATCAAACACTACGGTACTCTAATGGCCTTTGGATATGTGCAGTCTTCAAAGACAAATCGTAAATTTGTTACATGTGCACCAAACTTCTCATATAGTGTTGTGTGTGAAGCCAAGAAACACATATTCATATACAAATGTGCAGCGGAAGAGACTCAGTTGAGACGGAGGACAGCTGGAGTCATGAAGACAATTAAAGTGGGACAACAACATGTCATTAATATTGACATGTTCGGAGAGGTGTTAGGAGTGCATGCTACAAATGAATACTTGTTTGTCTTAACAGAGAAAAATCTTATAGCGATATGTATTTAA

Protein sequence:

>DPOGS215338-PA
MPSVLVELRPNKKLLDHEFEGYKINLQSLAQYNLKLTTTADRLYPDEVQYSFVHAKLFALHNHLIFDVWDHQYNFYYIDNQQQVINVTFDNINNSFQTAVVYDIPQHVERKPGHFNLCLTFPANYLAIISDGTGFLHIVDTGTRNRPATDKQPWNTIHSSLALGEGKYFTVVDSRMQKKCNVDILHCLLQSVEQNDNHFDTVLTWLSFEFIDHKWKQFAKREVKGKGIVHYAAFETQCQALYIASDNMFKFTSDSVKNILPPQQPEQKPIIYTWLQTADDITVTLKLEDNFDKKLLFVHVTPLSIKIRYAGKEFVSGKLKNKVDSELTTWNIQDNGQVDVLITKSESDMWNELIEGGDQNGKEILDASLVEEIHQRLAHLCSETEVMSDQPLTSLASQELEECDAASEEDTVLARLDTTTHEITHRIPLSVHHYLFSINLETHEAPALVLRHDVDGCVWQPFTQPFNSDSWPIKHYGTLMAFGYVQSSKTNRKFVTCAPNFSYSVVCEAKKHIFIYKCAAEETQLRRRTAGVMKTIKVGQQHVINIDMFGEVLGVHATNEYLFVLTEKNLIAICI-