Monarch geneset OGS2.0

DPOGS204067
TranscriptDPOGS204067-TA1131 bp
ProteinDPOGS204067-PA376 aa
Genomic positionDPSCF300200 + 13202-15246
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0131293e-11461.80% 
BombyxBGIBMGA010807-TA8e-12763.12% 
DrosophilaCG16719-PA3e-2230.56% 
EBI UniRef50UniRef50_D6X1I27e-3538.94%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X1I2_TRICA
NCBI RefSeqXP_969808.11e-3538.94%PREDICTED: similar to calponin-homology and microtubule-associated protein [Tribolium castaneum]
NCBI nr blastpgi|910931002e-3438.94%PREDICTED: similar to calponin-homology and microtubule-associated protein [Tribolium castaneum]
NCBI nr blastxgi|910931002e-3539.37%PREDICTED: similar to calponin-homology and microtubule-associated protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.1e-06protein binding
KEGG pathway 
InterPro domain[12-217] IPR0104414.8e-48Protein of unknown function DUF1042
[9-117] IPR0017151.1e-06Calponin homology domain
Orthology groupMCL12534 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204067-TA
ATGTCTGCCTACGATTCACCGATACCACTGTCTGATATTGAGACCGTCCTGGCCTGGGTCGACACATTCAAGCTGTCAAGGCCTACGAAGAAAATAAACAGAGATTTCTCTGATGCTGTACTTCTAGCGGAAATCTTAAGCGTGCATTATCCTAAGCTGGTGGAGATGCACAACTACCCGCCCAGGAATAGTCATTCCCTTAAATTGAACAATTGGATGACTCTGAACAGGAAGGTGCTTAAGAAGCTGCGTCTGAACCTGTGCAGCAACACCATGGAACAGCTCGCGAACTGCGCTCCTGGGGTCATCGAACGTGTTCTGGTTATGGTCCGAGACAAAATCCGTCGAGATGAGGACGCTAATAGGTCGCTGAAAGAGTCTGAACAAAATGTTTCCAGTGGCGGAAGTTACTACGAGGCCTGCGGAGACGAGGAACATGTATTGGTAGTTCCTGTTAAGATGAGAGTGAATGGAGTTTTAGAAACAATCCAGCAGAAAGTTGTCAACTACGAAACACATCTAACGCTTAAAGAAGAACTGAAAGATGCAAAGGAAACTGTGGAAACGCTGAAACAAAAGGTCGATCACTTGGACAAGTTGCTGAATCTTAAAGAGGAGAGAATAGAAGAATTGCAGAAACAATTAGAGCGAAAACAGACAAAGCGTAAAGAGGTTGAGGCCCTTCAGAATAGTTTAGCTGTGCCCTTTGAACCTGAACCGCCAACACCCAAAATTTTTGATCTGATACCTGTACCCAGCAGGACGGTTTCAGCGAAATCTTTAGAATCTTCCAGTAAAGTGTCCATCGAATCTAACAAATTACCAAATCAAGAACTCGAAGCGATCTCTAAGGATGAAACTAAAATTCCCATACCAAAACTCGTCATCCCTAAATGCAAATCCAAACCGAGTGTTACAGAAGTGCTTGATTTCAAGCATCAGAATTCCGAGGAAAAAATTGATGTGGACAGAATCAAATCCGACGTATTCAAGGAAATCGAAATAACTGATGATAATAATAATGATATGAGGATGCCCTCGAGTGTCCAGGAATTTATCACAATAGAAAATAGCATCAGGCAGGAGGTTCAGGAGATATGTGATGCGGTACATTTACGTGACACGACTTAG

Protein sequence:

>DPOGS204067-PA
MSAYDSPIPLSDIETVLAWVDTFKLSRPTKKINRDFSDAVLLAEILSVHYPKLVEMHNYPPRNSHSLKLNNWMTLNRKVLKKLRLNLCSNTMEQLANCAPGVIERVLVMVRDKIRRDEDANRSLKESEQNVSSGGSYYEACGDEEHVLVVPVKMRVNGVLETIQQKVVNYETHLTLKEELKDAKETVETLKQKVDHLDKLLNLKEERIEELQKQLERKQTKRKEVEALQNSLAVPFEPEPPTPKIFDLIPVPSRTVSAKSLESSSKVSIESNKLPNQELEAISKDETKIPIPKLVIPKCKSKPSVTEVLDFKHQNSEEKIDVDRIKSDVFKEIEITDDNNNDMRMPSSVQEFITIENSIRQEVQEICDAVHLRDTT-