Monarch geneset OGS2.0

DPOGS205173
TranscriptDPOGS205173-TA1113 bp
ProteinDPOGS205173-PA370 aa
Genomic positionDPSCF300197 - 49735-52701
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0218132e-11458.15% 
BombyxBGIBMGA001273-TA1e-9467.24% 
DrosophilaCG4335-PA4e-6836.97% 
EBI UniRef50UniRef50_E0W3Z14e-8144.02%Trimethyllysine dioxygenase, putative n=1 Tax=Pediculus humanus corporis RepID=E0W3Z1_PEDHC
NCBI RefSeqXP_002433085.17e-8244.02%trimethyllysine dioxygenase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420253451e-8044.02%trimethyllysine dioxygenase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420253452e-8044.02%trimethyllysine dioxygenase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00503533.4e-119trimethyllysine dioxygenase activity
GO:00055063.4e-119iron ion binding
GO:00453293.4e-119carnitine biosynthetic process
GO:00551143.4e-119oxidation-reduction process
GO:00314183.4e-119L-ascorbic acid binding
GO:00164912e-38oxidoreductase activity
KEGG pathwaydme:Dmel_CG43353e-66 
 K00471 (E1.14.11.1)maps-> Lysine degradation
InterPro domain[19-369] IPR0127763.4e-119Trimethyllysine dioxygenase
[128-353] IPR0038192e-38Taurine catabolism dioxygenase TauD/TfdA
Orthology groupMCL17294 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205173-TA
ATGTCTGCTATAACCGATGTTAATAAATTTGATAGAGCTCTAAATATAAAATTTGAAAATGGAACTTCTTTAAGTGTTGAAGATTGCTGGCTTCGAGATAATTGCAGATGTTCGTCTTGCTATCGCTCTGATACTTTTCAAAAAGCTAATTACCTATTAGAACTATCAGGCATCAAGATAAGCACTTTTAAATTCGACGCTTGTAATCTATACATCGAATGGAATGACAAACATCAGTCGACATACGAAGCATCTTTTTTAATAAAATTTCAATATGATAAATGGAAAGACAGTAGAAAACTAAGACCAATATTATGGAAAGGTGTGGACATTGAAAATAAAATAGCAAAGGTTCATGCCCGTGAATTCCTTAACACCGAGAGTGGTGCTATCACAGTATTCAAATCTCTAATAGACTACGGCGTTGCTTTAATTGATCAGGTAGGTGTATCTCTGGAAGCTACGGAACAGGTGTGCAAAGCGTTAGGTGGTGTACAACATACCTTATTTGGAGGCATGTGGCAGTTTACCTCCGTCCAGGACCACGCTGACACCGCTTACACGAATCTGCCATTAGCTGTGCACAATGATAACACTTACTTCAACGAGGCAGCTGGCCTCCAGATACTGCATTGCCTGGAACATTCCAATGGAGAAGGCGGAGAAACAATTTTAGTTGATGGTTTTTTTGGAGCTACGAAATTGAAAAATGAATTCCCTGAGGACTTTGAATTCTTGACTAATTTTAATTTAGACGCGGAATATTTGGAAGATGGTCATCATTATAAATATTCGGCACCTGTTATAAATGTTGACAAGGAAAAGAATTTGGAACAAATAAGGTTTAATGTTTACGACCGCTCACCGATGGCATTCTCTAGTGGTGAGGAATGCAGGTCATACTACCGCGCGCTATGGAATCTCTCGACCTATTACCAAAATACCGACTGTCAATGGAAATTTAAATTATACCCAGGTTTGGTGATGGTGATGGACAACTTCAGAGTACTTCATGGAAGAACAGCTTTCACAGGAAATCGCATAGTATGCGGTAGTTACGTGTCACGTTCCGATTGGCTTAATAAAGCACGAGCTTTGAAACTTATACAATAG

Protein sequence:

>DPOGS205173-PA
MSAITDVNKFDRALNIKFENGTSLSVEDCWLRDNCRCSSCYRSDTFQKANYLLELSGIKISTFKFDACNLYIEWNDKHQSTYEASFLIKFQYDKWKDSRKLRPILWKGVDIENKIAKVHAREFLNTESGAITVFKSLIDYGVALIDQVGVSLEATEQVCKALGGVQHTLFGGMWQFTSVQDHADTAYTNLPLAVHNDNTYFNEAAGLQILHCLEHSNGEGGETILVDGFFGATKLKNEFPEDFEFLTNFNLDAEYLEDGHHYKYSAPVINVDKEKNLEQIRFNVYDRSPMAFSSGEECRSYYRALWNLSTYYQNTDCQWKFKLYPGLVMVMDNFRVLHGRTAFTGNRIVCGSYVSRSDWLNKARALKLIQ-