Monarch geneset OGS2.0

DPOGS215598
TranscriptDPOGS215598-TA1422 bp
ProteinDPOGS215598-PA473 aa
Genomic positionDPSCF300097 + 286821-292780
RNAseq coverage163x (Rank: top 52%)
Annotation
HeliconiusHMEL0223678e-12457.57% 
BombyxBGIBMGA008821-TA0.073.02% 
DrosophilaCG5599-PA5e-12449.00% 
EBI UniRef50UniRef50_P533952e-12954.55%Lipoamide acyltransferase component of branched-chain alpha-keto acid dehydrogenase complex, mitochondrial n=10 Tax=Euteleostomi RepID=ODB2_MOUSE
NCBI RefSeqXP_310535.46e-12754.87%AGAP000549-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2240573232e-13255.43%PREDICTED: dihydrolipoamide branched chain transacylase E2 [Taeniopygia guttata]
NCBI nr blastxgi|3123848551e-12655.23%hypothetical protein AND_01468 [Anopheles darlingi]
Group
Gene OntologyGO:00469491.4e-207fatty-acyl-CoA biosynthetic process
GO:00480371.4e-207cofactor binding
GO:00437541.4e-207dihydrolipoyllysine-residue (2-methylpropanoyl)transferase activity
GO:00084154.7e-78acyltransferase activity
GO:00081524.7e-78metabolic process
KEGG pathwaytgu:1002292153e-133 
 K09699 (E2.3.1.168, bkdB)maps-> Valine, leucine and isoleucine degradation
InterPro domain[10-473] IPR0157611.4e-207Lipoamide Acyltransferase
[242-471] IPR0232133.7e-87Chloramphenicol acetyltransferase-like domain
[243-471] IPR0010784.7e-782-oxoacid dehydrogenase acyltransferase, catalytic domain
[56-156] IPR0110533.3e-22Single hybrid motif
[63-134] IPR0000894.2e-18Biotin/lipoyl attachment
[162-207] IPR0041673e-14E3 binding
Orthology groupMCL13123 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215598-TA
ATGGCATTCGCAACTAGAAGATTTATTTTGCTTAACTTAAGGAATTTGAAATATAGTCTGTATTCTGCAAATGCCAGAAAAAACACTCTCAGACATTACTCGAAAGCACTGGTGTCATTTAAAATTGACGATAAACAGAATACGAGAAGGTTGCAAACAACAAACACATACAACAAGAGCGTTGCCTTCAAACTATCGGATATTGGTGAAGGCATACGAGAAGTTGTAGTTAAAGAATGGTACATCAAAGTAGGAGACAAAGTTCAGCAATTTGACAACATATGCGAAGTCCAGAGCGATAAAGCAGCGGTAACAATCAGTAGCCGATACGACGGTGTTGTCACCAAACTATACTATGAAGTTGATCAAACAGCCCTCGTTGGTCAACCTCTAGTTGACATAGAAGTAGAAGATGCAGAGGAAGACTCGTCTCAAAAGAGTGCTATTCCTGAAATAACTAAAGAAGTTCCGAAATCAGAAGTTAAAAGTGAACGAATCAAGGTATTGACAACACCAGCTGTAAGAAGAATTGCTGCACAGTTCAGAGTAGACTTGAGTAACGTTAACGCCACTGGCAAAAACGGAAGAGTACTTAAAGAAGATGTTTTATCCCATTTAAATATGAGTTCTGATAAATCTAATGATATACCACAGAATGACTTGTCAGTTGAAGCTTTGTCAATACCTGTCACAACTGGTTTTGCTAAAATGGAAACTATAGTGGAAGATAAAATAGTTCCCATCACAGGCTTTACAAAGGCTATGGTAAAATCTATGACAGAAGCTATGAAAATACCTCATTTTGTATTTAGTGACGAATACGATGTAACAAAATTAGTAGAATCAAGAGAAAATCTGAAAATTATGGCTAAAAATAGAGGAGTTAAATTAACCTATATGCCCATAATAATAAAAGCTGCGTCGCTGAGTATTGCTAAGTATCCAATAATAAACAGCAGTCCTGACAGCAATTGTGAAAACATTATATACAAAGCCAGTCATAATATTGGAGTGGCGATGAACACTCCTAACGGTTTAGTGGTGCCAGTTATAAAGAACGTTCAAAATAAAAATATTATTGAATTAGCGAGAGAACTGAATTCGCTCCAAGAAAAAGGCTCTAAAGGACAATTTGGATTTAATGATCTAAGTGGAGGAACCTTCACTATTTCAAACATTGGAATTGTAGGTGGAACTTATACAAAGCCCATAATATTTTCGCCACAAGTGTCGATTGGTGCTCTAGGAAAGATTCAGGTTTTGCCTAGATTCGATTCAGAAGGTAACGTAGTAAAGGCTCACATATTATCTGTGAGTTTTGCAGCTGACCATAGGATTATCGACGGAGTCACTATGGCAAGTTTTTCGAATCAACTAAAGGAATATCTAGAGAATCCACAAGTACTACTTTTAGATCTGTGA

Protein sequence:

>DPOGS215598-PA
MAFATRRFILLNLRNLKYSLYSANARKNTLRHYSKALVSFKIDDKQNTRRLQTTNTYNKSVAFKLSDIGEGIREVVVKEWYIKVGDKVQQFDNICEVQSDKAAVTISSRYDGVVTKLYYEVDQTALVGQPLVDIEVEDAEEDSSQKSAIPEITKEVPKSEVKSERIKVLTTPAVRRIAAQFRVDLSNVNATGKNGRVLKEDVLSHLNMSSDKSNDIPQNDLSVEALSIPVTTGFAKMETIVEDKIVPITGFTKAMVKSMTEAMKIPHFVFSDEYDVTKLVESRENLKIMAKNRGVKLTYMPIIIKAASLSIAKYPIINSSPDSNCENIIYKASHNIGVAMNTPNGLVVPVIKNVQNKNIIELARELNSLQEKGSKGQFGFNDLSGGTFTISNIGIVGGTYTKPIIFSPQVSIGALGKIQVLPRFDSEGNVVKAHILSVSFAADHRIIDGVTMASFSNQLKEYLENPQVLLLDL-