Monarch geneset OGS2.0

DPOGS204429
TranscriptDPOGS204429-TA1269 bp
ProteinDPOGS204429-PA422 aa
Genomic positionDPSCF300002 - 376001-379613
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0062423e-15087.27% 
BombyxBGIBMGA007726-TA0.087.26% 
DrosophilaCG4908-PA0.071.19% 
EBI UniRef50UniRef50_Q9Y2765e-15363.21%Mitochondrial chaperone BCS1 n=96 Tax=Eukaryota RepID=BCS1_HUMAN
NCBI RefSeqXP_001863575.10.075.30%mitochondrial chaperone BCS1 [Culex quinquefasciatus]
NCBI nr blastpgi|3123724420.076.25%hypothetical protein AND_20171 [Anopheles darlingi]
NCBI nr blastxgi|910820570.077.78%PREDICTED: similar to AGAP004266-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055245.5e-20ATP binding
GO:00001662.4e-12nucleotide binding
GO:00171112.4e-12nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[23-192] IPR0148512.7e-49BCS1, N-terminal
[228-355] IPR0039595.5e-20ATPase, AAA-type, core
[223-358] IPR0035932.4e-12ATPase, AAA+ type, core
Orthology groupMCL12282 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204429-TA
ATGACACTAACAGAATATGTAGCGTCATTGTCACAAAACCCATACTTTGGGGCTGGATTTGGCTTGTTTGGAATTGGTGCGGGAGCGGCTATTTTAAGGAAAGGATTCCAAACGTCTATGATCCTATTTCGAAGACATTGTATGATAACTCTAGAGGTTCCGTGTCGCGATAAGTCCTACCAGTGGTTGTTACACTGGATCACACAAAAGGGTGCAAAACAAACACAACATCTTAGTGTGGAAACATCATTTTTGCAAAAGGATACAGGGCAGATAAAAACAAAATATGACTTTATACCAAGCGTCGGCCAACATTTCTTTAGATATGGCGGTACATGGATAAGAGTTGATAGAACCAGAGAACAGCAAACCATAGATTTACACATGGGCATACCATTTGAACACGTCACTCTAACTGCTTTTGGCCGCAATAAAGAAATATACTACAATATACTTGAGGATGCTAGAACTATGGCTCTTAAACAGCACGAGGGTATGACAGTCATGTATACAGCTATGGGATCCGAATGGAGAACTTTTGGTCATCCTCGGAAGCGACGACCTTTACATAGCGTCATACTGCGATCTGGGCTCACAGAAAAAATACTTACTGACTGTCTTGACTTCATCGACAATCCCAACTGGTATACTGATAGAGGAATTCCTTACCGGAGAGGTTATTTACTGTACGGACCCCCAGGTTGTGGTAAATCGTCATTTATAACAGCTTTAGCAGGACAGCTGGAATATAACATATGTGTTCTAAATTTATCTGAAAGAGGACTAACTGATGATAGACTCAATCATTTACTGAGTGTAGCTCCCCAACAATCAATAATTCTTCTAGAAGACATCGACGCTGCGTTCGTATCCCGTGAAGATACTCCCAAACAGAAAGCCGCCTTCGAAGGCTTAAATAGGGTTACTTTTAGCGGGCTCCTAAATTGTTTAGACGGAGTCGCCTCAACTGAAGCTCGGATTGTTTTTATGACCACAAATTATTTGGAAAGATTGGATCCAGCATTAATTAGACCAGGTAGAGTTGATATGAAAGAGTATGTTGGGTACTGCGACCAGGCCCAAGTAGAGCTCATGTTTCTAAGGTTTTACAAAGACGCCGATGAACACGCTAAAAGCTTTGCACAAAAAGTTATGGATTACAAAAAAGATGTCAGCCCAGCTCAAATACAAGGCTACTTCATGTTCCATAAATATTCAACACCAGAGGAAGTCCTCACGAATGTTGGTACTATATGGACTCTCGGGTAA

Protein sequence:

>DPOGS204429-PA
MTLTEYVASLSQNPYFGAGFGLFGIGAGAAILRKGFQTSMILFRRHCMITLEVPCRDKSYQWLLHWITQKGAKQTQHLSVETSFLQKDTGQIKTKYDFIPSVGQHFFRYGGTWIRVDRTREQQTIDLHMGIPFEHVTLTAFGRNKEIYYNILEDARTMALKQHEGMTVMYTAMGSEWRTFGHPRKRRPLHSVILRSGLTEKILTDCLDFIDNPNWYTDRGIPYRRGYLLYGPPGCGKSSFITALAGQLEYNICVLNLSERGLTDDRLNHLLSVAPQQSIILLEDIDAAFVSREDTPKQKAAFEGLNRVTFSGLLNCLDGVASTEARIVFMTTNYLERLDPALIRPGRVDMKEYVGYCDQAQVELMFLRFYKDADEHAKSFAQKVMDYKKDVSPAQIQGYFMFHKYSTPEEVLTNVGTIWTLG-