Monarch geneset OGS2.0

DPOGS211519
TranscriptDPOGS211519-TA1428 bp
ProteinDPOGS211519-PA465 aa
Genomic positionDPSCF300354 + 63667-65094
RNAseq coverage1690x (Rank: top 8%)
Annotation
HeliconiusHMEL0081260.080.98% 
BombyxBGIBMGA003815-TA0.070.88% 
DrosophilaCG5214-PA1e-11677.91% 
EBI UniRef50UniRef50_UPI000206466C2e-12075.00%UPI000206466C related cluster n=1 Tax=unknown RepID=UPI000206466C
NCBI RefSeqXP_971313.26e-15570.66%PREDICTED: similar to dihydrolipoamide succinyltransferase component of 2-oxoglutarate dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|1892391441e-15370.66%PREDICTED: similar to dihydrolipoamide succinyltransferase component of 2-oxoglutarate dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|2700107821e-16872.83%hypothetical protein TcasGA2_TC010587 [Tribolium castaneum]
Group
Gene OntologyGO:00060995.3e-153tricarboxylic acid cycle
GO:00452525.3e-153oxoglutarate dehydrogenase complex
GO:00041495.3e-153dihydrolipoyllysine-residue succinyltransferase activity
GO:00084151.5e-76acyltransferase activity
GO:00081521.5e-76metabolic process
KEGG pathwaytca:6599542e-154 
 K00658 (DLST, sucB)maps-> Citrate cycle (TCA cycle)
    Lysine degradation
InterPro domain[64-462] IPR0062555.3e-153Dihydrolipoamide succinyltransferase
[235-462] IPR0232138.2e-83Chloramphenicol acetyltransferase-like domain
[235-462] IPR0010781.5e-762-oxoacid dehydrogenase acyltransferase, catalytic domain
[58-157] IPR0110531.5e-21Single hybrid motif
[64-135] IPR0000899e-17Biotin/lipoyl attachment
Orthology groupMCL13056 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211519-TA
ATGTTGAGACGCTGCTCCAAACACCTGCAGACCATATATCGGCGGCAGGGTCACAGTTTCCGCTTTAGGTCGTCAGAGGTCACCAGGGTCTGCGCCGCCCTACCCAAAGCTACCCTCGCGAAGCCGAGAGTCTTATCCAGCACCCAAACAGCCTCCTTCCACTTCACAAACACCCTGTTAGCGGAACAGGATGTGATGACTCCGAGCTTCCCAGACTCCGTGTCGGAAGGTGACGTCAAGCTTGACAAAAAAGTTGGTGACGCTGTCGCCGCTGATGAGGTCGTGCTTGAAATCGAAACAGACAAAACCGCTATCCCGGTCATGGCGCCGGACAACGGTATCATCAAGGAGTTGTACGTCAAGGATGGAGAGACGGTGAAGGCTGGGCAGAAACTGTTCAGGCTGGAGATCACGGGGGCGGCTCCCAAAAAGGCAGCGCCTGCTGCGCCTGAACCTCCCAAGGAAGTGCCACCGCCGCCTCCAGCTGCAGCCGCGCCCCCACCACCAGCCGCTGTTCCCCCTCCACCGGCCGCAGTTCCCCCACCACCGGCTGCCGCCCCACCCCCCCAGCAGGCCCCTCCGAAACCAGCCGCCCCGATCTCGTCCATCCCAGTAGCAGCCATCCGTCACGCACAGGCGATAGAAACAGCCTCAGTGAAGGTCCCACCATCAGACTACAGCAAAGAAATCGTCGGGACCAGAAGCGAACAGCGCGTCAAGATGAACCGCATGAGGCTGCGTATAGCCGAGAGACTGAAGGACGCGCAGAACACGAACGCTCTGCTGACGACGTTCAACGAAATAGACATGTCCCATATCATGGCCTTCAGGAAGAAACATCTGGACGCGTTCACCAAAAAGCACGGAGTGAAGCTGGGTTTGATGTCGCCCTTCGTCAAGGCGTCCGCTACAGCTCTCATGGACCAGCCGGTCGTGAACGCTGTGATCGAAGGAAACGAGATTATTTACCGCGACTACGTGGACATATCGGTTGCCGTCGCCACGCCCAAGGGTCTGGTGGTGCCTGTCATAAGGAACGTCCACAACATGACCTACGCGGACATCGAGCTGAATATAGCTGAGCTGGCTGAAAAGGCGAGGAAAGGCAGGCTGACCATCGAGGAGATGGACGGGGGTACCTTCACTATCAGCAACGGGGGCGTCTTCGGGTCCCTGATGGGGACGCCCATCGTGAACCCGCCGCAGTCAGCGATCCTGGGCATGCACGGCATCTTCGAGCGTCCCATCGCTCTGAACGGTCAAGTGGTCATCAGACCTATGATGTACATAGCTCTAACTTACGACCACAGATTGATAGACGGACGCGAGGCCGTCATGTTCCTTAGGAAGATCAAGGAGGGGGTGGAGGATCCCGCCACGATCATCGCTGGCTTGTAAGACGACAGCAAAATTACAGAATTTTTTTAA

Protein sequence:

>DPOGS211519-PA
MLRRCSKHLQTIYRRQGHSFRFRSSEVTRVCAALPKATLAKPRVLSSTQTASFHFTNTLLAEQDVMTPSFPDSVSEGDVKLDKKVGDAVAADEVVLEIETDKTAIPVMAPDNGIIKELYVKDGETVKAGQKLFRLEITGAAPKKAAPAAPEPPKEVPPPPPAAAAPPPPAAVPPPPAAVPPPPAAAPPPQQAPPKPAAPISSIPVAAIRHAQAIETASVKVPPSDYSKEIVGTRSEQRVKMNRMRLRIAERLKDAQNTNALLTTFNEIDMSHIMAFRKKHLDAFTKKHGVKLGLMSPFVKASATALMDQPVVNAVIEGNEIIYRDYVDISVAVATPKGLVVPVIRNVHNMTYADIELNIAELAEKARKGRLTIEEMDGGTFTISNGGVFGSLMGTPIVNPPQSAILGMHGIFERPIALNGQVVIRPMMYIALTYDHRLIDGREAVMFLRKIKEGVEDPATIIAGL-