Monarch geneset OGS2.0

DPOGS213103
TranscriptDPOGS213103-TA945 bp
ProteinDPOGS213103-PA314 aa
Genomic positionDPSCF300016 + 96756-99340
RNAseq coverage659x (Rank: top 19%)
Annotation
HeliconiusHMEL0065541e-12273.57% 
BombyxBGIBMGA007863-TA4e-15282.28% 
DrosophilaCG33099-PA5e-7745.05% 
EBI UniRef50UniRef50_D2CVI63e-14381.65%Flavone synthase I n=3 Tax=Neoptera RepID=D2CVI6_BOMMO
NCBI RefSeqNP_001164469.15e-14481.65%flavone synthase I [Bombyx mori]
NCBI nr blastpgi|2834839771e-14281.65%flavone synthase I [Bombyx mori]
NCBI nr blastxgi|2834839772e-15381.65%flavone synthase I [Bombyx mori]
Group
Gene OntologyGO:00167061.1e-11oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00551141.1e-11oxidation-reduction process
GO:00164911.1e-11oxidoreductase activity
KEGG pathway 
InterPro domain[152-264] IPR0051231.1e-11Oxoglutarate/iron-dependent oxygenase
Orthology groupMCL15874 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213103-TA
ATGAAATCAGTGGTTCGACGTATTGGACAACAGCTGTTCACTGCTCTTAGCACTAAAGGTCTCGCCATGCTTGTCAACCACGGCGTCGCAGAAGAGAAGTTGAAAGCGGTTTACGCAGATTTAGATAACTTTTGTGCTCTACCCGAGGGTTGCCAGGCGCAGTATCTTCGCAACCCGATCAGCAACCACGGTTATGTGAAACCTGGTATGGAGCAGTTCGACGAGACCAAGAAGGAGTTACGTCACTCGTTTAATATAACCACTCTGTCGGCGGCGGCCATGCCCGCTCAAGAAGAGGTGCCAGAGTTTACGCAACACGCCGTGCCCCTCGCTCACGATCTCACCAATCTTTCCCGTGTGCTCTTACAGGCTCTCGGTTACGCTTTTGGTCTGCCCCCCTCCACTCTGCTCTCCTGCCACTCCCACATGCTACAGAGCGATGGCTGCAATGCCTCCACAATGCGTATCCTGTACTACCCGCCCGTACCCCCCGAAGATGAAGGTCCATGCTACGAACACGTTACGTATACGAGATGCGGTGCACACTCCGACAGGTGCACCTTCACGCTCGTCGCTCAGGACTCGGAAGGGGGGCTTGAGGTTAAGTTGAATGGCAGTGATAAGTGGCAATCTGTTGGTCATCTGCCAGGAGCGATTCTCGTACAAACTGGAGAACTTCTTGCTTCTTGGACTACTAACCTGCTACCGGCCCTGATGCACCGCGTCGTTGTACCGTCCGGCACATACGCTCGCGCCCGCGGCCGTCACTGCGTCGCCTTCTTCTGTCACCCGGACAATGAAGCGATCATTCCCCCCCTCGCCCTCCGCCCCGCGCCCGTCCCCGCTCCCCCCGCCTTCACCCCGCACACGCACCTCACCTTACACCACCGGCTGCTGAACGCGGCCCACCACATACAGAAAAGATTCAGAGAAACGTACGCGTGA

Protein sequence:

>DPOGS213103-PA
MKSVVRRIGQQLFTALSTKGLAMLVNHGVAEEKLKAVYADLDNFCALPEGCQAQYLRNPISNHGYVKPGMEQFDETKKELRHSFNITTLSAAAMPAQEEVPEFTQHAVPLAHDLTNLSRVLLQALGYAFGLPPSTLLSCHSHMLQSDGCNASTMRILYYPPVPPEDEGPCYEHVTYTRCGAHSDRCTFTLVAQDSEGGLEVKLNGSDKWQSVGHLPGAILVQTGELLASWTTNLLPALMHRVVVPSGTYARARGRHCVAFFCHPDNEAIIPPLALRPAPVPAPPAFTPHTHLTLHHRLLNAAHHIQKRFRETYA-