Monarch geneset OGS2.0

DPOGS210282
TranscriptDPOGS210282-TA1032 bp
ProteinDPOGS210282-PA343 aa
Genomic positionDPSCF300216 + 212196-223024
RNAseq coverage451x (Rank: top 27%)
Annotation
HeliconiusHMEL0225651e-8381.11% 
BombyxBGIBMGA001068-TA1e-10055.39% 
DrosophilaCG6028-PA1e-6341.28% 
EBI UniRef50UniRef50_Q2F6001e-11459.59%Fumarylacetoacetate hydrolase isoform B n=2 Tax=Obtectomera RepID=Q2F600_BOMMO
NCBI RefSeqNP_001103763.14e-7470.11%fumarylacetoacetase [Bombyx mori]
NCBI nr blastpgi|872483295e-11459.59%fumarylacetoacetate hydrolase isoform B [Bombyx mori]
NCBI nr blastxgi|872483292e-11159.77%fumarylacetoacetate hydrolase isoform B [Bombyx mori]
Group
Gene OntologyGO:00081521.8e-64metabolic process
GO:00038241.8e-64catalytic activity
KEGG pathway 
InterPro domain[183-341] IPR0112341.8e-64Fumarylacetoacetase, C-terminal-related
[181-340] IPR0025291.2e-50Fumarylacetoacetase, C-terminal-like
Orthology groupMCL11013 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210282-TA
ATGAAGCTAGTACAGTTTGTTTATAATGAAAATCCAGGAGAAATCCGCGCCGGTTATCTCGAAGGCGACAAAGTCGTGGACATAAACAAAACGGATTGCAGTATACCAAGTACTCTGTTGGAGATACTAAAGAACGGTGACTTGGAGAAAGTTAAGAAACTACCACTTATGAAACCACCCTCAATAGATCTAAGTTCCATTAAATTGGCAGCGCCAATACATGGCCATGACAAAGTGCTCTGCATCGGACTCAATTATAAGGATCACTGCGAGGAGCAGAATCTGACACCACCCCAAGTTCCGATGATTTTCAGCAAGTTCGCCAGCACAGTGGTTGGACCCAAGGACGCTGTAAAGCTACGGACAGATGTCACTAATAAAGTGGATTGGGAAGTCGAATTAACTGTGGTGATCGGCAAACCCGCGAACAGAGTCAAAGCTAAAGATGCGTATAAATATGTCCTTGGATACACGGTGGCACAGGACATCAGTGCAAGGGACTGGCAGAAGGAGAGAAACGGTGGACAATTCCTTCTGGGAAAGAAAGTGGACTGGGAAGTCGAATTAACTGTGGTGATCGGCAAACCCGCGAACAGAGTCAAAGCTAAAGATGCGTATAAATATGTCCTTGGATACACGGTGGCACAGGACATCAGTGCAAGGGACTGGCAGAAGGAGAGGAACGGTGGACAATTCCTTCTGGGAAAGTCAATGGACACCTTCTGCCCCATCGGTCCCTGTATCACCACCAGCGACGAGATCCCCGATCCTCAGAGCCTGTACATCAAATGCAGCGTGAATGGTGTTGAGAAACAGAAGAGCAACACAAACCAATTAGTTCATAAAATACCAGACGTCATAGAGAGACTGAGCTCGGTAATGACTCTCCTCCCTGGGGACATCCTTCTAACTGGCACGCCAGGGGGTGTGGGAATGTACAGATCACCTCCAGAGTATTTAAAGCCCGGTGACGTCATACACAGCGAAATAGAGAAGATCGGAGTACTGGAAACCAGAGTAGAGCAGCTCTAG

Protein sequence:

>DPOGS210282-PA
MKLVQFVYNENPGEIRAGYLEGDKVVDINKTDCSIPSTLLEILKNGDLEKVKKLPLMKPPSIDLSSIKLAAPIHGHDKVLCIGLNYKDHCEEQNLTPPQVPMIFSKFASTVVGPKDAVKLRTDVTNKVDWEVELTVVIGKPANRVKAKDAYKYVLGYTVAQDISARDWQKERNGGQFLLGKKVDWEVELTVVIGKPANRVKAKDAYKYVLGYTVAQDISARDWQKERNGGQFLLGKSMDTFCPIGPCITTSDEIPDPQSLYIKCSVNGVEKQKSNTNQLVHKIPDVIERLSSVMTLLPGDILLTGTPGGVGMYRSPPEYLKPGDVIHSEIEKIGVLETRVEQL-