Monarch geneset OGS2.0

DPOGS206668
TranscriptDPOGS206668-TA1020 bp
ProteinDPOGS206668-PA339 aa
Genomic positionDPSCF300048 + 543025-545493
RNAseq coverage339x (Rank: top 34%)
Annotation
HeliconiusHMEL0111502e-17490.35% 
BombyxBGIBMGA008499-TA2e-10485.84% 
DrosophilaCG13377-PA7e-3736.25% 
EBI UniRef50UniRef50_Q7QA374e-3935.80%AGAP004450-PA n=3 Tax=Anopheles RepID=Q7QA37_ANOGA
NCBI RefSeqXP_313742.47e-4035.80%AGAP004450-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582921841e-3835.80%AGAP004450-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1259829943e-3737.06%GA12240 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00054883.3e-18binding
GO:00081521.6e-06metabolic process
GO:00164911.6e-06oxidoreductase activity
KEGG pathwaydme:Dmel_CG133775e-35 
 K00019 (E1.1.1.30, bdh)maps-> Butanoate metabolism
    Synthesis and degradation of ketone bodies
InterPro domain[43-241] IPR0160403.3e-18NAD(P)-binding domain
[43-172] IPR0021981.6e-06Short-chain dehydrogenase/reductase SDR
Orthology groupMCL12899 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206668-TA
ATGGACCCGTTGACGTGGTTATCCCTTGGCCTGCAACTGGCGGCGTTGTGCAGCATAGTTGGAGCATTGCTGCTCTATCTATTGAGGAAGGTTCGCGTTGCTGAAGTGCTGCCAGTGGACAGCGCTAAGACTGTTCTTGTTACGTCCGTGGATTCCGCTCTTGGATTGCAGATTGCGACATACCTAAGCAGCAAGGGTTGGCGAGTTATAGCCGGCTGTCGACAGGGCGGCCTCGGCGCTCGTCTAGCCGAATCCTGGCTTCAAGCACATGTAGCCGCGACGCCGGAGAATCAAGCTCCCCCGAGGCTAGCGACTTTAGAACTGGATGTGGCTAGAGAAGATTTATTAGAGGAAGCGGCCAGAGCTACGGCCCAACATCTCCCCGCTGGAGAACATGGAGTTTGGGCAGTGATCAATACAGCTGGTAGCAGTGGTCTCGGTGGGGCTTCGGTCTGGGAGAGTGCCCTTCGTTGTAATATCCTTGGAGCCCTAAGGGTCGCCAGGACATTCTCACCGCTTCTGGCCGCTGCAGCTGCAGACCACCCTTATGCTGGGCGACTGTTTTATATTGGTCTTACATCAGACACGGCCTGTGAAAGCCTATCCCGTGGTGAGAGCGAAGGCAGCGCATGTTCTGCCGCCGTAAGATGGGGCACTTGGGGCGCTGCTCGTGCATTGCGCGCTACTTTGCGTGCACGGAGGCTCCACGTCGTCCTCCTGCACGCGCCTGATCTAGCTGCCGAGGAAATATACGCACCACCGATGCAGATCACGCCTATAAGCCAACCGTCAAGCCGCCCGGATACACCGAATTCTGAAGTAAGCTCGTCGTCTACAGCTACCTGTGCGGTGACCATGCCTGGTGAGGCTGCGGAGTACAGCGCTAAAGTCTTACCGACCAGTGCTTTGAAGGTTCTAGAGGAAGCTCTGACGTCACCATCCCCGAGAGACTCCTATTATTTGAAAATCAAACAAGATTCTTGGTTTACAAGGATGCCATCTCTGAGAGTATCCCATTGA

Protein sequence:

>DPOGS206668-PA
MDPLTWLSLGLQLAALCSIVGALLLYLLRKVRVAEVLPVDSAKTVLVTSVDSALGLQIATYLSSKGWRVIAGCRQGGLGARLAESWLQAHVAATPENQAPPRLATLELDVAREDLLEEAARATAQHLPAGEHGVWAVINTAGSSGLGGASVWESALRCNILGALRVARTFSPLLAAAAADHPYAGRLFYIGLTSDTACESLSRGESEGSACSAAVRWGTWGAARALRATLRARRLHVVLLHAPDLAAEEIYAPPMQITPISQPSSRPDTPNSEVSSSSTATCAVTMPGEAAEYSAKVLPTSALKVLEEALTSPSPRDSYYLKIKQDSWFTRMPSLRVSH-