Monarch geneset OGS2.0

DPOGS215648
TranscriptDPOGS215648-TA993 bp
ProteinDPOGS215648-PA330 aa
Genomic positionDPSCF300041 - 1532169-1535113
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0141144e-17590.61% 
Bombyx% 
Drosophilawal-PB4e-14475.76% 
EBI UniRef50UniRef50_P138046e-12467.49%Electron transfer flavoprotein subunit alpha, mitochondrial n=44 Tax=root RepID=ETFA_HUMAN
NCBI RefSeqXP_969306.12e-14977.11%PREDICTED: similar to cxpwmw03 [Tribolium castaneum]
NCBI nr blastpgi|910759665e-14877.11%PREDICTED: similar to cxpwmw03 [Tribolium castaneum]
NCBI nr blastxgi|910759667e-14277.11%PREDICTED: similar to cxpwmw03 [Tribolium castaneum]
Group
Gene OntologyGO:00090555.9e-144electron carrier activity
GO:00506605.9e-144flavin adenine dinucleotide binding
KEGG pathway 
InterPro domain[18-331] IPR0013085.9e-144Electron transfer flavoprotein, alpha subunit
[19-203] IPR0147295e-65Rossmann-like alpha/beta/alpha sandwich fold
[21-203] IPR0147305e-51Electron transfer flavoprotein, alpha/beta-subunit, N-terminal
[210-294] IPR0147311.1e-38Electron transfer flavoprotein, alpha subunit, C-terminal
Orthology groupMCL12698 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215648-TA
ATGTTTTCACCGTGCTCTAGGCACTTGTTTTTGTCTTCTCAGCTGCGTCGTCTTCAAAGTACGTTGATTGTAGCTGAACACAACAATGAGTCGCTCTTACCAGCAACACAAAATGCCCTGAACGCTGCAAAGAAAATCGGTGGAGAAGTTTCTGTGCTCGTTGCTGGTACCAAATGTGGACCAGCCGCCGAAGCGATTTCTAAAGCAGGTGGAGTGTCAAAAGTGTTGGTAGCAGAAAACGAGGCTTTTAAGGGCTTTACCTCTGAATCGTTGACATCTTTGATATTGGCCACTCAGAAGCAATTCAACTTCACACATATATTGGCACCAGCGACAGCATTTGGCAAGGCCCTCCTACCTCGCGTTGCTGCCAAACTTGATGTCTCACCCATCACTGATATCATTGGTGTTAAAGATGCTAACACCTTTATAAGGACAATTTATGCTGGTAATGCAGTTTTAACCTTAGAGGCTAAGGATCCCATCAAGGTTATCACGGTAAGAGGAACAGCATTTCCAGCAGAACCTTTGGAAGGTGGTTCAGCGAGTGTGGAGAAGGCCCCCGAGGGAGATTACAAAACCGATCTCACTCAGTGGGTGTCACAGGAGATAACGAAATCTGATCGGCCGGAATTGACCAGTGCTAAGAATATTGTATCTGGAGGTCGTGGTCTCAAGTCTGGTGAGAACTTCAAGCTCCTGTATGATCTAGCTGATAAGTTGAATGCAGCGGTTGGGGCTTCCCGTGCTGCAGTTGACGCGGGCTTTGTACCCAACGACCTGCAGATCGGTCAGACGGGAAAAATTGTCGCTCCTGATCTATACATAGCTGTTGGCATCAGTGGAGCCATTCAACATCTGGCTGGTATGAAGGACTCCAAGACCATTGTTGCCATCAATAAAGACCCCGAAGCACCCATCTTTCAGGTATCTGACCTAGGTCTCGTTGCAGACTTGTTCAAGGCAGTCCCCGAACTTACTTCTAAGTTGTAA

Protein sequence:

>DPOGS215648-PA
MFSPCSRHLFLSSQLRRLQSTLIVAEHNNESLLPATQNALNAAKKIGGEVSVLVAGTKCGPAAEAISKAGGVSKVLVAENEAFKGFTSESLTSLILATQKQFNFTHILAPATAFGKALLPRVAAKLDVSPITDIIGVKDANTFIRTIYAGNAVLTLEAKDPIKVITVRGTAFPAEPLEGGSASVEKAPEGDYKTDLTQWVSQEITKSDRPELTSAKNIVSGGRGLKSGENFKLLYDLADKLNAAVGASRAAVDAGFVPNDLQIGQTGKIVAPDLYIAVGISGAIQHLAGMKDSKTIVAINKDPEAPIFQVSDLGLVADLFKAVPELTSKL-