Monarch geneset OGS2.0

DPOGS215754
TranscriptDPOGS215754-TA957 bp
ProteinDPOGS215754-PA318 aa
Genomic positionDPSCF300041 + 1269413-1274272
RNAseq coverage604x (Rank: top 21%)
Annotation
HeliconiusHMEL0141399e-17288.05% 
BombyxBGIBMGA003631-TA2e-15279.49% 
DrosophilaCG3609-PB9e-7240.19% 
EBI UniRef50UniRef50_E2AXQ42e-7945.91%Trans-1,2-dihydrobenzene-1,2-diol dehydrogenase n=8 Tax=Formicidae RepID=E2AXQ4_CAMFO
NCBI RefSeqXP_395693.21e-8146.67%PREDICTED: similar to CG3609-PA [Apis mellifera]
NCBI nr blastpgi|3228028501e-8148.42%hypothetical protein SINV_04686 [Solenopsis invicta]
NCBI nr blastxgi|3228028501e-7948.42%hypothetical protein SINV_04686 [Solenopsis invicta]
Group
Gene OntologyGO:00054882.7e-35binding
GO:00164913.8e-19oxidoreductase activity
KEGG pathwaycin:1001860382e-65 
 K00078 (DHDH)maps-> Pentose and glucuronate interconversions
    Metabolism of xenobiotics by cytochrome P450
InterPro domain[2-134] IPR0160402.7e-35NAD(P)-binding domain
[14-109] IPR0006833.8e-19Oxidoreductase, N-terminal
Orthology groupMCL11088 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215754-TA
ATGATCGCTCATGACTTCCTAACAGCGATGGCGTCTCTGTCGCCTGAACAGCACAAGGTTACAGCTGTGGCTGGTAAAGACTTGGATCGTGTTTACAGATTGGCAACGCTCCATAATATTAGCACTGCCTATGAAGGCTACGATGCGATCGCAGCGGACAACACAATTGACATTGTTTTCGTCAGCGTTTTAAACTTGCAACATTACGAAGTAACAAAGCTAATGTTAGAGAATGGTAAACACGTTTTATGTGAGAAACCTATGGGCATGACATATAAACAAACGAAATCCCTAGTGGATTTAGCGCGGGAAAAAAAACTGTTTCTTCTCGAAGGGATGTGGTCTAGGTTCTTCCCGGCTTACGACGCTTTGGACAAGCACATCACTAGCGGCGGCTTGGGTGATATATATCATATAAATGTGCAATTCGGTGTGGAAATTAATGATATTGAAAGGAATCTGATGAAAGATTTGGGCGGCGGCGTTGTGTTGGATTTAGGTATATACATGTTACAATTGCTGCAATTCATATACAAGGAACCGCCGACAGATATTATGTGTACCGGGCATTTGAGTAAGACCGGGGTTGACGAATCAATGTCCTGCGCATTAAAATACAAAGACGGTAGAACGGCTACCATCGCGGCGCATACGAGGGCAGCGCTCGCTAACAGAGCGGAAATAAATGGTACTAAAGGAACCATACATCTGGACTATTTTTGGTGTCCAACTGTTTTGCACATATCAGCAGCAAATAGTACCGAATGGACGTTGCCGAAGGGCAAATACAAGTTTCATTTCCACAACAGCGCTGGACTTGCTTACCAGATACAAGAATGTTGGGATTGTATTAACAACGGTTTGCTCGAGAGTCCGAAGATGAATTTAGACGAGAGTGTACTTTTATCTAAACTCTGTGACACAATGAGAACACAACTGGGAGTGCTCGACTCCTAG

Protein sequence:

>DPOGS215754-PA
MIAHDFLTAMASLSPEQHKVTAVAGKDLDRVYRLATLHNISTAYEGYDAIAADNTIDIVFVSVLNLQHYEVTKLMLENGKHVLCEKPMGMTYKQTKSLVDLAREKKLFLLEGMWSRFFPAYDALDKHITSGGLGDIYHINVQFGVEINDIERNLMKDLGGGVVLDLGIYMLQLLQFIYKEPPTDIMCTGHLSKTGVDESMSCALKYKDGRTATIAAHTRAALANRAEINGTKGTIHLDYFWCPTVLHISAANSTEWTLPKGKYKFHFHNSAGLAYQIQECWDCINNGLLESPKMNLDESVLLSKLCDTMRTQLGVLDS-