Monarch geneset OGS2.0

DPOGS212045
TranscriptDPOGS212045-TA1026 bp
ProteinDPOGS212045-PA341 aa
Genomic positionDPSCF300054 + 474815-476151
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0129707e-10574.79% 
BombyxBGIBMGA010193-TA6e-10272.69% 
DrosophilaCG31548-PA2e-6149.15% 
EBI UniRef50UniRef50_F4WC285e-6049.58%Tetratricopeptide repeat protein 27 n=2 Tax=Acromyrmex echinatior RepID=F4WC28_ACREC
NCBI RefSeqNP_001040154.14e-6853.78%short-chain dehydrogenease/reductase [Bombyx mori]
NCBI nr blastpgi|1140507718e-6753.78%short-chain dehydrogenease/reductase [Bombyx mori]
NCBI nr blastxgi|1140507715e-6553.78%short-chain dehydrogenease/reductase [Bombyx mori]
Group
Gene OntologyGO:00054889.7e-73binding
GO:00081523.7e-29metabolic process
GO:00164913.7e-29oxidoreductase activity
KEGG pathwayrmr:Rmar_24509e-33 
 K00059 (fabG)maps-> Biosynthesis of unsaturated fatty acids
    Fatty acid biosynthesis
InterPro domain[1-237] IPR0160409.7e-73NAD(P)-binding domain
[7-24] IPR0023473e-40Glucose/ribitol dehydrogenase
[6-171] IPR0021983.7e-29Short-chain dehydrogenase/reductase SDR
Orthology groupMCL25711 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212045-TA
ATGGATTTCACAAATAAAGTAGTAGTTATAACTGGAGGAAGTTCAGGTATAGGAGCTGCTACAGCGATTTATTTTTCAAAACTTTCAGCACAATTAGTTTTAGTAGGTAGAAAAGAAAACAATCTCAAAAAAATCTCCTTGTATTGTGAAAAAGCTAAAGCAGTTAAGCCATTGCCTATCGTAGCAGATTTAACTGAAGATTCGGACGTGGAAAGAATCGTTACAGAAACTATAGACCATTTTGGAAAGATCGACGTTCTGATTAATAATGCTGGCGTTATGTCAATGGGAGGACTTAAGGAATCTAATATGGAAATGTATGACAAAGTTATGTCAACGAATATTCGTGCAGTATATTATTTAACCAAACTTTTTACTCCACATCTAATAGAAAGCAAGGGTTGTATTGTAAATGTTTCAAGTATTCTTGGAAGTAAAGTTACAACAAACGCCTTGGCATACAATATGTCAAAAGCTGCATTGGATCACTTCACAAAATGTGTCGCTTTGGAACTGGGACCAGATGGTGTAAGAGTAAACTCTGTAAATCCAGGTTTTGTGAAGACCAATCTCTTGAAAGATGTTGGGCTTTCTGAAGATCAACTTGAAATGTTGATGAAAAATATTGTGTGTAGAAATCCATTGAAACGACAGGTGGAAGGTGATGAAGTCGCTGCTCTGATTGCATTTCTTGCTAGTGATAAAGCAAAAAACAAGAAAAACAAGCCATTACAGTTCTTTATGAAACCTGAAAAACTAAGTCCTCAAGAACCGTCTAAGAAAAGATCAAAGAAATCCGAATACGAGGAACCGAAACAGAAAGAATACGACAAAAATGAGATGGATTTCGTTGAAGTGGATGAATGTAACGATCCTAGAATAATGAACGAGGATGAAGCGTTCTTCGCGTCACTCCTACCCTCTGTCGTGAAGTACAATGAGGACGAAAGACTCGAATTCCGCATGGAAGTGTTGGCCATCATGAAACGGATAAAGGACAAACGGAAATGGACCAATGACGTGTGA

Protein sequence:

>DPOGS212045-PA
MDFTNKVVVITGGSSGIGAATAIYFSKLSAQLVLVGRKENNLKKISLYCEKAKAVKPLPIVADLTEDSDVERIVTETIDHFGKIDVLINNAGVMSMGGLKESNMEMYDKVMSTNIRAVYYLTKLFTPHLIESKGCIVNVSSILGSKVTTNALAYNMSKAALDHFTKCVALELGPDGVRVNSVNPGFVKTNLLKDVGLSEDQLEMLMKNIVCRNPLKRQVEGDEVAALIAFLASDKAKNKKNKPLQFFMKPEKLSPQEPSKKRSKKSEYEEPKQKEYDKNEMDFVEVDECNDPRIMNEDEAFFASLLPSVVKYNEDERLEFRMEVLAIMKRIKDKRKWTNDV-