Monarch geneset OGS2.0

DPOGS209672
TranscriptDPOGS209672-TA2196 bp
ProteinDPOGS209672-PA731 aa
Genomic positionDPSCF300134 - 202491-211240
RNAseq coverage1160x (Rank: top 11%)
Annotation
HeliconiusHMEL0021659e-18064.94% 
BombyxBGIBMGA000703-TA2e-17365.60% 
DrosophilaPlod-PA0.050.50% 
EBI UniRef50UniRef50_E0VRF90.056.61%Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3, putative n=7 Tax=Coelomata RepID=E0VRF9_PEDHC
NCBI RefSeqXP_001601697.10.056.01%PREDICTED: similar to procollagen-lysine,2-oxoglutarate 5-dioxygenase [Nasonia vitripennis]
NCBI nr blastpgi|3504216780.054.57%PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3504216780.054.57%PREDICTED: procollagen-lysine,2-oxoglutarate 5-dioxygenase 3-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00167053.6e-21oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.6e-21iron ion binding
GO:00551143.6e-21oxidation-reduction process
GO:00314183.6e-21L-ascorbic acid binding
GO:00167062.8e-08oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00164912.8e-08oxidoreductase activity
KEGG pathwaynvi:1001174670.0 
 K13647 (PLODN)maps-> Lysine degradation
InterPro domain[556-730] IPR0066203.6e-21Prolyl 4-hydroxylase, alpha subunit
[645-731] IPR0051232.8e-08Oxoglutarate/iron-dependent oxygenase
Orthology groupMCL10562 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209672-TA
ATGACCGGCCTCCTGCAATTATTCTGTCTTTTTCTTCTATTTACTAGTAAAACTGCAGGAACTACAAAAAGTAAAAGCCAACAACCCGATGTAAAGGTGTTAACGGTGGCTACTGAGAAGAACCATGGCTTGGAAAGATTCCTCCGCTCGGCCAGGGTATACAATATTAATGTGGAAGTTCTCGGTGAAGGCAAGAAGTGGGAGGGAGGGGATATGAAGCATGAGGGCGGTGGTCACAAAGTTAACCTGCTCAAAGATAAGCTGAGTTCAATGAAAATACCTGAAGATAGAGATCAGATTATTCTATTCACTGACAGCTACGATGTGATGTTCCTGGGATCGCTCGATGAGATAGTACAGAAGTTTCTCGCAATGTCCGTACGCGTGTTGTTCTCTGCGGAACCTTTCTGTTGGCCAGATTCCTCTCTCGCGTCGCAGTATCCCGACAGCCAGCAATTGAACCCGTTCCTTAATTCGGGCGGATTTATAGGATATTTACCGGAACTTTTGAAGATCTTGAACTATGAGACAGTGGGCAATAAAGATGACGATCAGCTCTTCTACACCAAAGTGTACTTGGATGAGGATTATAGAGAAAGTCTAAGAATTTCCCTTGACCACAAATCGGCCATATTCCAAAACCTCCATGGTGCCTTGTCGGATGTGCAACTCGTCGCTAACTCCACTGATGAATGGCCGTATCTTGTCAACGTGGTGACCAAGCAGAGGCCTCTGATCGTTCACGGAAACGGGCCCGCAAAATTGACCTTGAACAATTTGTCCAACTATTTGGCCAAGTCCTGGTCTGTCAGTGAGGGATGCGTTCTGTGCGATGAGAAGAGGATTGTGCTGGATGAGGACAAGCTGCCGAAGGTGATGCTCTCCGTATTCATAGAAGTCGCGACACCGTTTATAGAGGAATTTTTCCAAAGTATTCTGGCCATTGATTATCCCAAGCAGAAAATACATCTCTTTATCCGCAACGGTGTCGAGTATCATGAGTCGGAGGTGGAGAATTTCTATCAGGCTCACAGTAGCGAATATTTTACCGCCAAACGGATCAAATCCACTGACCTTGTGGGGGAGGCTGAAGCGAGGAACATTGCTAAGGACCGCTGTATCGGCAGCGACTGTGATTATCTCTTCTGCCTGGACAGCCACGCCCGTGTTGAACCTGATACACTTCATTACTTGCTCTCTACCGGATATGACGTCGTCGCCCCCTTACTAGTACGCAGTGGACAAGCTTGGTCAAACTTTTGGGGTGCTATTAACTCTGTTGGTTTCTATTCCCGTTCAGCTGATTATATGGATATTGTCAACCGCAGCATTGAAGGTATCTGGAACGTCCCGTTCATCAACAACTGCTACCTTATGAACATTTCCCTGTTCCGCAAACCGTCTGCCAAACATGTTAGCTATTTGAAAGAGGACACCGACCCTGATATGGCTTTCTGCGCTTCACTCAGATCTGCTGGTATCATGATGTACGTGAGCAATGAAAAGGAATTCGGTCATCTCGTTAATTCTGAAACGTTTGACGTGAGCCGCACTAACCCTGACATTTACCAAGTGATTGATAACAAGCTTGATTGGGAACAACGTTACCTCCACCCCAAGTACCATGAAATCTTCGCCAACAAAGAAAAGCAACTCATGCCCTGCCCCGACGTCTATTGGTTCCCACTGATGTCGATGCGCTTCTGTAAGGAATGGATCGAAGTCATGGAGGCCTTTGGACAATGGAGCGATGGATCTAACAATGACAAACGTCTAGAGAGTGGTTACGAAGCTGTTCCAACTCGTGATATTCACATGAACCAGGTCGGACTTGACATTCAATGGCTCCGAATCCTCAAGGATTACGTTCGTCCGCTGCAGGAGTTAGTTTTCACTGGATACTACCATAACCCCCCCGTGTCCGTCATGAACTTCGTGGTCCGTTATCGTCCTGATGAACAGCCCTCTCTACGGCCGCACCACGACTCCTCAACTTACACCATCAACCTGGCTCTGAATACTCCCCACTTGGATTACGAGGGTGGTGGTTGTCGGTTCATCCGCTACAACTGTTCGGTGAAGGACACCAAGCCCGGTTGGCTTTTGATGCACCCTGGCCGTCTGACCCACTTCCACGAGGGTCTCCTCGTCACCAAGGGCACACGTTACATTATGATCTCATTCGTGGACCCGTAA

Protein sequence:

>DPOGS209672-PA
MTGLLQLFCLFLLFTSKTAGTTKSKSQQPDVKVLTVATEKNHGLERFLRSARVYNINVEVLGEGKKWEGGDMKHEGGGHKVNLLKDKLSSMKIPEDRDQIILFTDSYDVMFLGSLDEIVQKFLAMSVRVLFSAEPFCWPDSSLASQYPDSQQLNPFLNSGGFIGYLPELLKILNYETVGNKDDDQLFYTKVYLDEDYRESLRISLDHKSAIFQNLHGALSDVQLVANSTDEWPYLVNVVTKQRPLIVHGNGPAKLTLNNLSNYLAKSWSVSEGCVLCDEKRIVLDEDKLPKVMLSVFIEVATPFIEEFFQSILAIDYPKQKIHLFIRNGVEYHESEVENFYQAHSSEYFTAKRIKSTDLVGEAEARNIAKDRCIGSDCDYLFCLDSHARVEPDTLHYLLSTGYDVVAPLLVRSGQAWSNFWGAINSVGFYSRSADYMDIVNRSIEGIWNVPFINNCYLMNISLFRKPSAKHVSYLKEDTDPDMAFCASLRSAGIMMYVSNEKEFGHLVNSETFDVSRTNPDIYQVIDNKLDWEQRYLHPKYHEIFANKEKQLMPCPDVYWFPLMSMRFCKEWIEVMEAFGQWSDGSNNDKRLESGYEAVPTRDIHMNQVGLDIQWLRILKDYVRPLQELVFTGYYHNPPVSVMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPHLDYEGGGCRFIRYNCSVKDTKPGWLLMHPGRLTHFHEGLLVTKGTRYIMISFVDP-