Monarch geneset OGS2.0

DPOGS216038
TranscriptDPOGS216038-TA1707 bp
ProteinDPOGS216038-PA568 aa
Genomic positionDPSCF300067 - 337511-344857
RNAseq coverage788x (Rank: top 16%)
Annotation
HeliconiusHMEL0089290.074.82% 
BombyxBGIBMGA009018-TA0.082.60% 
DrosophilaCG11208-PA0.065.27% 
EBI UniRef50UniRef50_Q9UJ836e-18053.44%2-hydroxyacyl-CoA lyase 1 n=60 Tax=Eumetazoa RepID=HACL1_HUMAN
NCBI RefSeqNP_001040193.10.082.60%2-hydroxyphytanoyl-CoA lyase [Bombyx mori]
NCBI nr blastpgi|1140519140.082.60%2-hydroxyphytanoyl-CoA lyase [Bombyx mori]
NCBI nr blastxgi|1140519140.082.60%2-hydroxyphytanoyl-CoA lyase [Bombyx mori]
Group
Gene OntologyGO:00309764.7e-48thiamine pyrophosphate binding
GO:00002872.3e-32magnesium ion binding
GO:00038242.1e-24catalytic activity
KEGG pathwaydwi:Dwil_GK156800.0 
 K12261 (HACL1)maps-> Peroxisome
InterPro domain[5-171] IPR0120014.7e-48Thiamine pyrophosphate enzyme, N-terminal TPP-binding domain
[193-322] IPR0120002.3e-32Thiamine pyrophosphate enzyme, central domain
[389-547] IPR0117662.1e-24Thiamine pyrophosphate enzyme, C-terminal TPP-binding
Orthology groupMCL11921 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216038-TA
ATGGGGATCGACGGGAACAATATACTCGCTGAAAGCTTGAAGCGACAAGGCATTGAATATGTTTTTGGGATTGTTGGTATTCCTGTAATAGAGACTTCATTAGCTTTTCAAGCTGCAGGTCTCAAGTACATTGGGATGCGAAATGAACAGGCAGCCTGTTATGCTGCTCAAGCTATTGGCTATTTAACAGGTAAACCAGGAGTATGTCTGGTTGTATCTGGCCCTGGTCTCTTACATTGTATTGGAGGTATGGCCAATGCTCAAGTTAACTGTTGGCCGCTGTTAGTCATAGCCGGATCTTGCCCGGAAGACCATGAAGGCATTGGCGGTTTCCAGGAATGGCTGCAGGTGGAGTCATCTCGTCAGTATAGTAAATATGCAGCCCGACCACCTTCCCCACGACTTATTCCACTACATGTAGAAAAAGCAATCAGATATGCCAGCTCCGGACGTCCAGGTGTCGCTTATCTTGATATGCCTGCTACCTTATTGACGGCTGAAGCTGATGAAGATAAGGTTCCTTTAGACTACTACTCAGCGGATCCAGTTAGTTTGGCTCACCCAAATCCAGTACTGGTAAATGAAGCAGCTGACCTATTGTCCAAGGCTGAAAGACCCCTCATCATAGTTGGCAAAGGAGCTGCTTATGGAAAAGCAGAAGAGGCTATCACCAAGCTTGTGGAGAATATTAAAGTACCATTCCTACCGACTCCTATGGGCAAAGGAGTGGTCCCAGACGAGTCTCAATACTGTGTGTCAACCGCTCGCACTCAGGCGCTACTTGGGGCTGACGTCATACTGCTGTTGGGGGCAAGAATGAATTGGATGATGCATTTCGGACAGGTCCCGAGATATGCAGCCAATGTTAAGATTATTCAAGTGGATATAGCTCCCGAAGAATTCCACAACAGTGTCAAATCAGAAGTGGCCGTCCATTCAGATATCAAACCGTTTGTGGAAGCGCTCACAAATAAACTAGCGGAGAAAAAGTTTTCATTACAAAATAATAGTCCCTGGTGGCAAGCATTGAAGGAAAAACAAAAGAAAAACACAGAATTTGTTAAGGCACAAGCAGCCGATAAATCCCTGCCACTGAATTACTATGCAGTTTTTAAAGCTGTTCAAGAAAATATCCCAAAGGATTCAATAATAGTGAGTGAAGGCGCTAACACTATGGACATCGGCCGCGGGATATTACTCAATAACAAACCGAGGCATCGTCTGGACGCGGGAACATTTGGCACTATGGGGGTCGGCCCCGGGTTCGCCGTAGCTGCGGCGCAGTGGTGCCGTGACCACGCTCCAGATAAACGAGTGATTTGTGTTGAAGGAGATTCTGCGTTTGGTTTCTCAGGTATGGAAATTGAGACAATGTTCCGCTACAAGTTGCCAGTGATTATTGTGATTGTGAACAACAACGGCATTTACAGCGGCTTCGACAAAGAAATGATGACGGAGATACAAAACTCCGGCGATCTTGCCCAGTGTACTCCACCCACAGCACTGTCAACGGAAGTGAGATATGAAAAAATGATGGAAATGTTTGGATCAAGCGGCCATTTCTGTCGTACAGTTGAAGAAATCGAAAATGCCTTGAAATCAGCTATTAAAGTAACCGACAGACCCAGTATTATAAATATTGCCATTAACCCACAATCCAACAGAAAACCTCAAACATTCAACTGGCTGACTGAATCAAAACTATAA

Protein sequence:

>DPOGS216038-PA
MGIDGNNILAESLKRQGIEYVFGIVGIPVIETSLAFQAAGLKYIGMRNEQAACYAAQAIGYLTGKPGVCLVVSGPGLLHCIGGMANAQVNCWPLLVIAGSCPEDHEGIGGFQEWLQVESSRQYSKYAARPPSPRLIPLHVEKAIRYASSGRPGVAYLDMPATLLTAEADEDKVPLDYYSADPVSLAHPNPVLVNEAADLLSKAERPLIIVGKGAAYGKAEEAITKLVENIKVPFLPTPMGKGVVPDESQYCVSTARTQALLGADVILLLGARMNWMMHFGQVPRYAANVKIIQVDIAPEEFHNSVKSEVAVHSDIKPFVEALTNKLAEKKFSLQNNSPWWQALKEKQKKNTEFVKAQAADKSLPLNYYAVFKAVQENIPKDSIIVSEGANTMDIGRGILLNNKPRHRLDAGTFGTMGVGPGFAVAAAQWCRDHAPDKRVICVEGDSAFGFSGMEIETMFRYKLPVIIVIVNNNGIYSGFDKEMMTEIQNSGDLAQCTPPTALSTEVRYEKMMEMFGSSGHFCRTVEEIENALKSAIKVTDRPSIINIAINPQSNRKPQTFNWLTESKL-