Monarch geneset OGS2.0

DPOGS202962
TranscriptDPOGS202962-TA1245 bp
ProteinDPOGS202962-PA414 aa
Genomic positionDPSCF300195 + 518110-519354
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0111232e-15768.03% 
BombyxBGIBMGA005751-TA2e-13460.00% 
DrosophilaCG5321-PB3e-10343.28% 
EBI UniRef50UniRef50_Q17KD91e-10345.21%Epsilon-trimethyllysine 2-oxoglutarate dioxygenase n=4 Tax=Diptera RepID=Q17KD9_AEDAE
NCBI RefSeqXP_001653834.13e-10445.21%epsilon-trimethyllysine 2-oxoglutarate dioxygenase [Aedes aegypti]
NCBI nr blastpgi|1571234395e-10345.21%epsilon-trimethyllysine 2-oxoglutarate dioxygenase [Aedes aegypti]
NCBI nr blastxgi|1954572881e-10148.07%GK14708 [Drosophila willistoni]
Group
Gene OntologyGO:00551141.2e-35oxidation-reduction process
GO:00164911.2e-35oxidoreductase activity
KEGG pathwaydpo:Dpse_GA188031e-101 
 K00471 (E1.14.11.1)maps-> Lysine degradation
InterPro domain[157-391] IPR0038191.2e-35Taurine catabolism dioxygenase TauD/TfdA
Orthology groupMCL17267 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202962-TA
ATGATTCTCAGACGGTTATATACTAGCAAACATTATTTTAGAAATACAGTAAATGTGAAAAATGTGACAAATAAGTTTTATTCAAACGACGTAAAGAGCAAAACTAAGAGAGATATATTAGATTTAAATATTCGCGGCGAAACCTATTCATTCCCTCACGTCTGGTTACGGGACAACTGCCAGTGCGACCAATGTTTCCATAAATTTGCTCAAAGCCGGATTATTGATTGGAGCAAATTCAACTTGAACAGCTTCCCGAAAGATGTCGTCAAAGATGAAAATTCCGTAAGAATCACTTGGGATGATGGCCACATATCCCAATATACCACGAACTGGTTACTGTTTCGAAGTTTTAACCGTGATAAACAACAGTATTACGACAACTACATCTATAAACCTGAAAAAATTGTTTGGCATGGGAAAGAATTTAGTAAAATTTGCACTAAACACGACTACAAAGCCATAGTAGAAACGGATGAGGCTCTTTATAAGTGGTTGTACAATTTATCAACATATGGAGTGGCTCTCATAGAAAACACACCACCTTCTGAGCATTCCATGGAATCTATTGTTGATAGGATAGGGTTTAAAAAGAGGACACACTATGGACATAATTTCATTGTTCAAAATGTTCCAAACACTAATAACGTTGCATATCTATCGAGCACTTTACAGTTGCACACAGACCTTCCATATTATGAATACTGTCCGGGAGCTAATTTACTGCACTGTCTAGTCCAGACCAATAGCAAAGGAGGAGAAAACTTACTATCAGACTGTCATTACACTGCCAGATACATGAAACAAAAGCATCCCGATGCATATAAATTATTAACGGAAGTGGAAGTAGAGTGGAGCGACATAGGAACTGAGCAAGGGAATGAGTTCTACAAACTGCATAGAGGTCCAGTTATTTGTACCGACAAGCACGGAGACATAACCAGGATTAACTTTTCTATACCTCAGAGAGGTAGCTATCTGCCAATACCTATTGAATTGGTCAAACCATGGTTTGAAGCTCACTCAATGTTTTTTGAATTTAATACAAAATTTTCTGCAAACTTTAAGACAAAATCTGGAGATATTCTTACTTTTGACAACATAAGGTTATTGCATGGCAGGAACCAGTACAAAGACTCTGCTGAAAATGTAAGGAAATTGATAGGAGCTTATGTGGATTGGGATGAAATATACTCAAGACTCAGATGTTTAAAAGTGAAACTACATCCGGAGAATACTTCTTAG

Protein sequence:

>DPOGS202962-PA
MILRRLYTSKHYFRNTVNVKNVTNKFYSNDVKSKTKRDILDLNIRGETYSFPHVWLRDNCQCDQCFHKFAQSRIIDWSKFNLNSFPKDVVKDENSVRITWDDGHISQYTTNWLLFRSFNRDKQQYYDNYIYKPEKIVWHGKEFSKICTKHDYKAIVETDEALYKWLYNLSTYGVALIENTPPSEHSMESIVDRIGFKKRTHYGHNFIVQNVPNTNNVAYLSSTLQLHTDLPYYEYCPGANLLHCLVQTNSKGGENLLSDCHYTARYMKQKHPDAYKLLTEVEVEWSDIGTEQGNEFYKLHRGPVICTDKHGDITRINFSIPQRGSYLPIPIELVKPWFEAHSMFFEFNTKFSANFKTKSGDILTFDNIRLLHGRNQYKDSAENVRKLIGAYVDWDEIYSRLRCLKVKLHPENTS-