Monarch geneset OGS2.0

DPOGS206235
TranscriptDPOGS206235-TA2541 bp
ProteinDPOGS206235-PA846 aa
Genomic positionDPSCF300334 + 123387-133462
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0079282e-10350.29% 
BombyxBGIBMGA009691-TA0.088.32% 
DrosophilaHdc-PA8e-14667.63% 
EBI UniRef50UniRef50_Q057331e-14367.63%Histidine decarboxylase n=52 Tax=Coelomata RepID=DCHS_DROME
NCBI RefSeqXP_002089916.14e-14567.92%GE19346 [Drosophila yakuba]
NCBI nr blastpgi|1954752888e-14467.92%GE19346 [Drosophila yakuba]
NCBI nr blastxgi|2693168437e-13866.47%putative glutamate decarboxylase [Eumenes pomiformis]
Group
Gene OntologyGO:00197524.3e-231carboxylic acid metabolic process
GO:00168314.3e-231carboxy-lyase activity
GO:00301704.3e-231pyridoxal phosphate binding
GO:00038244.1e-93catalytic activity
GO:00065201.4e-58cellular amino acid metabolic process
KEGG pathwaydya:Dyak_GE193461e-144 
 K01590 (E4.1.1.22, HDC)maps-> Histidine metabolism
InterPro domain[1-601] IPR0021294.3e-231Pyridoxal phosphate-dependent decarboxylase
[1-357] IPR0154249.5e-106Pyridoxal phosphate-dependent transferase, major domain
[84-345] IPR0154214.1e-93Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[6-25] IPR0109771.4e-58Aromatic-L-amino-acid decarboxylase
[458-553] IPR0154224.1e-32Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL15925 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206235-TA
ATGGACCACAAGGAGTTCAGAGTGAAAGCGAAAGAGCTGGTGGACTATATGGCGGATTACCTGGAGAACATCAGAGACCACAAGGTGTACCCTGGCGTACAACCAGGGTACCTCCACAAACGACTCCCGGACCACGCGCCGGAGATGCCGGAGAAATGGGACGACATCTTCAAGGACGTCGAAGATCACATCATGCCCGGGATCGTCCACTGGCAGAGTCCCCACATGCACGCCTACTTCCCAGCGTTGACCTCTTACCCATCAATCATGGGAGAAATGCTTTCCAGTGCTATGAACGTTCTCTGCTTCACCTGGGCCTCGTCGCCAGCTGGTACAGAATTGGAGACGATCGCGATGAACTGGCTCGGGAAGCTCCTCGGTCTGCCGGATTGCTTCCTCAACGAAAAGAACGACAGCCAAGGAGGAGGTGTGATACAGACTACAGCAAGCGAGGCAACCCTGGTGAGTCTGCTGGCTGCTCGCACCAGGGCTCTGATGGAACTATCGGCTCTCAACCCTGACATGCAGTCTTCTGAACTGCTCGGACATTTGATAGCGTACTGCTCGGACCAAGCACATTCCTCAGTGGAGAAGGCTGGACTCATTGGTCTGGTGCGGATGCGTTACATAGAGTCGGATGAGCACCAGTGCATGAGAGGTGACAAGTTAGAGGAAGCCATCATCAACGACAAAGCCAAGGGACTGGTCCCGTTTTGGGTTTGCGCCACTCTCGGTACGACAGGGTCCGTAGCCTTCGACGATCTCCGGGAGATAGGCCCGGTGTGTGACCAGCACTCCATCTGGCTGCACGTAGATGCTGCATATGCTGGGAGCGCTTTCATATGCCCCGAATACAGACACTGGCTGGATGGGATCGAGCTGGTGGATTCCTTCGCCTTCAATCCATCCAAGTGGCTGATGGTCAACTTCGACTGCACCGGCATGTGGGTCAGGGACAGTAACGCTCTGCACAGAACCTTCAACGTGAACCCTATTTATCTAAGACACGAAAATTCAGGCAAAGCTATCGACTACATGGTTGGTTCTCAACACGCGAACACCAGGCGGAGAATCGAAACCGAGATCCTCGTCAAAGGTGATGAGCATGACGACAGTGTTATAAGTGACGAGGGTTACCCGCCCAGTGAGGAGAGTAGTCTCACTGACGACAGTGACGAGGATATCAAACCACGGGACGCGCCAGTAGCTGATAAGATACAGTTGGAAAACCCCGATAAGAAAGAGCCGCTCGTATGTCTTAGCTTACGGGTGTCGGTCCTCGCGGTGTCGGAGAGCTTTAAAGCTGTGGTTCGTGTTGAGGAACTATGGAGTGAGCGGCCTCCAGAAACACATAAGAGAGGTCAGTGGAGCGTCCGTCTAGCTCAGAAGTTTGAGGCCTTGGTGCTAGCTGATCAACGTTTTGAAATACCACAACCACGGAATCTGGGCATGGTTGCTTTCCGTCTCAAGGGAGACAACACCCTCACCGAGTACCTTCTGAAGCGCCTGAACGCCCGCGGCTACCTACACGCCGTGCCGGCATGTTTCAAGGGCGTCTACGTCATCAGGTTCACCGTCACCTCCCAAAGAACCACCAACCAGGACATACTCGACGACTGGACAGAGATTAAGACGGTGGCGTCCGAAATACTGAAGGAAATGTTTGGATCAGAAAACGGAAACATCGTGGTGTCCAAGAAACCGAGGATTTCTTTGAAAGAAACTCGCGAACTGAACGCTACGTTTGGGACGAGCCTGTTGCTAGCCAACAGTCCGATGAGTCCAAAGATCGTAAACGGCACTCACGCAGCGATATGCGATTATGAGTCACTGCTCTCATCCTGCGCTCAGACCTTCGCTGAGCTGAAAATGGAACCCAAAGATAGTCCTGAAATGCGGCGTCGCGTCCGCGGTATGAAGGCGTGCGGGAAGAAGTTCTCGCTGGACTCCTACATGGACATGCTCCAGGAGCTGGTGGTGGCGTCCCTGCCGCAGTGCTCGGAGGAGAAGGAGGAGACTCCCAACGGATCCAGCCCCGCCGACCGTTCCATCTCATCACCTGTGGTATCCAGCACTACCGTAAAACCAGTCGCCTGCACGGACACCAACCAATTACTAGTACCAATGACTCCATCCAGACAGTTCAGGTCAAAATCGGTCGACGAAACCGATTTGAAGCTAGACGACGCGGTCATCTCCGTAGACATCAAAAACAACGAGATCACGCTCACGCCGACCGATTCTAAAAGCATTCTCGACGCTCGCGACGTATCCGAGCTCAAAATCGGCGATCGGATCTCGAGGGCATTTGACCTCATGGACACTAACAATATGGAATGTAAGGAAGCGGGCGAAGCCAAGTTGACCATCAAAGGACCCGGGAGCTACATTAAGCAGATCATACAGCAGTTCAGCGAGGGACCCTTCGACGCGGAGGACTGCAAACCTGACCCAGGACGAGCTGTCGCCACACAGAGCCTCAAACAGCGGGCTGACGCGTTTTGCAAGAAATGCCTTCACTACAAAGGCGTCAACAAGTAA

Protein sequence:

>DPOGS206235-PA
MDHKEFRVKAKELVDYMADYLENIRDHKVYPGVQPGYLHKRLPDHAPEMPEKWDDIFKDVEDHIMPGIVHWQSPHMHAYFPALTSYPSIMGEMLSSAMNVLCFTWASSPAGTELETIAMNWLGKLLGLPDCFLNEKNDSQGGGVIQTTASEATLVSLLAARTRALMELSALNPDMQSSELLGHLIAYCSDQAHSSVEKAGLIGLVRMRYIESDEHQCMRGDKLEEAIINDKAKGLVPFWVCATLGTTGSVAFDDLREIGPVCDQHSIWLHVDAAYAGSAFICPEYRHWLDGIELVDSFAFNPSKWLMVNFDCTGMWVRDSNALHRTFNVNPIYLRHENSGKAIDYMVGSQHANTRRRIETEILVKGDEHDDSVISDEGYPPSEESSLTDDSDEDIKPRDAPVADKIQLENPDKKEPLVCLSLRVSVLAVSESFKAVVRVEELWSERPPETHKRGQWSVRLAQKFEALVLADQRFEIPQPRNLGMVAFRLKGDNTLTEYLLKRLNARGYLHAVPACFKGVYVIRFTVTSQRTTNQDILDDWTEIKTVASEILKEMFGSENGNIVVSKKPRISLKETRELNATFGTSLLLANSPMSPKIVNGTHAAICDYESLLSSCAQTFAELKMEPKDSPEMRRRVRGMKACGKKFSLDSYMDMLQELVVASLPQCSEEKEETPNGSSPADRSISSPVVSSTTVKPVACTDTNQLLVPMTPSRQFRSKSVDETDLKLDDAVISVDIKNNEITLTPTDSKSILDARDVSELKIGDRISRAFDLMDTNNMECKEAGEAKLTIKGPGSYIKQIIQQFSEGPFDAEDCKPDPGRAVATQSLKQRADAFCKKCLHYKGVNK-