Monarch geneset OGS2.0

DPOGS208048
TranscriptDPOGS208048-TA939 bp
ProteinDPOGS208048-PA312 aa
Genomic positionDPSCF300203 + 270934-272242
RNAseq coverage340x (Rank: top 34%)
Annotation
HeliconiusHMEL0145614e-6948.75% 
BombyxBGIBMGA001476-TA2e-12667.09% 
Drosophila% 
EBI UniRef50UniRef50_E2BPK81e-8252.33%PKHD domain-containing transmembrane protein FLJ22222-like protein n=11 Tax=Endopterygota RepID=E2BPK8_HARSA
NCBI RefSeqXP_001605079.16e-9452.56%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565372211e-9252.56%PREDICTED: PKHD domain-containing transmembrane protein C17orf101 homolog [Nasonia vitripennis]
NCBI nr blastxgi|1565372216e-9052.56%PREDICTED: PKHD domain-containing transmembrane protein C17orf101 homolog [Nasonia vitripennis]
Group
Gene OntologyGO:00167056.4e-12oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055066.4e-12iron ion binding
GO:00551146.4e-12oxidation-reduction process
GO:00314186.4e-12L-ascorbic acid binding
KEGG pathway 
InterPro domain[99-295] IPR0066206.4e-12Prolyl 4-hydroxylase, alpha subunit
Orthology groupMCL16187 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208048-TA
ATGTCGGAAGTAAAGAAAAGAAGCAAACAAAGTATTAGCACCGACAAAATCTTAGAGTCTGAAGAAAAAAAAGCTAAAGAAAAAGCCTCACCAAATAAAAATTTACCTCTAAGGATTCTTTCTCGAACTGTAGTTATATTTTCACTATTGATAATTGTTTATATGTCAGCAAAAGATAGAACAAAAATTTTTGCCAAACAAACGGAAGTAATTCCAGGGAAAGGATTGATTATTGAGTGTTCCTCTGAATATATGAAAGAATTAGATGAATATGAAGGCTGTGCACCAAAAAATTGTAAAAGATATGTTACCGATACTGTTATATCTACAAAAGAAGCTGACGAGCTTCTGAATATGGCGAAGAGAGGTTTAAAACATGGAGGTTCTCTAGGAGGGGCATCAATATTAGACCTACATAGTGGTGCCTTATCGAAGGGCTCAAATTTTGTAAATTTCTATAAATTGAAAGAAATGAAAAACATCTTTGATCAGAATGATTTTAACACTTTTAGGGTGGTGAAGGACAAAATTAAATATGCTATTGCTCATCACTTTGGAGTTCAGCCAACTAAAATATATTTGACATATCCCACATTCTTTTCGGAGATAAGCACAAAGAAAGCTGTTACAATCCACGATGAATACTGGCATCCCCATGTTGATAAAGAGTCCTACAAGTCATTCCATTACACAACGCTGCTCTATTTGGGCGATTACAATATAGACTTTAAAGGTGGCCAGTTTGTTTTCATTGATAGCAACTACAACTATACAGTTGAACCTCGGAAGGGCAGATTAAGCATGTTTACCAGTGGTGCTGAGAACTTACACCATGTTCAGAAAGTTACAGCCGGAGTGAGATATGCAATGACTATATCGTTTACATGTAACAAAAACTATGCAATAGCAGATCCCGGTGTTGAAAAATATATTTCATAG

Protein sequence:

>DPOGS208048-PA
MSEVKKRSKQSISTDKILESEEKKAKEKASPNKNLPLRILSRTVVIFSLLIIVYMSAKDRTKIFAKQTEVIPGKGLIIECSSEYMKELDEYEGCAPKNCKRYVTDTVISTKEADELLNMAKRGLKHGGSLGGASILDLHSGALSKGSNFVNFYKLKEMKNIFDQNDFNTFRVVKDKIKYAIAHHFGVQPTKIYLTYPTFFSEISTKKAVTIHDEYWHPHVDKESYKSFHYTTLLYLGDYNIDFKGGQFVFIDSNYNYTVEPRKGRLSMFTSGAENLHHVQKVTAGVRYAMTISFTCNKNYAIADPGVEKYIS-