Monarch geneset OGS2.0

DPOGS212591
TranscriptDPOGS212591-TA1515 bp
ProteinDPOGS212591-PA504 aa
Genomic positionDPSCF300245 - 339631-344303
RNAseq coverage241x (Rank: top 43%)
Annotation
HeliconiusHMEL0060250.057.49% 
BombyxBGIBMGA005193-TA3e-12057.76% 
DrosophilaninaB-PA8e-9845.64% 
EBI UniRef50UniRef50_A8Y9I29e-17857.06%Neither inactivation nor afterpotential B n=2 Tax=Obtectomera RepID=A8Y9I2_GALME
NCBI RefSeqXP_967460.17e-13246.71%PREDICTED: similar to beta-carotene dioxygenase [Tribolium castaneum]
NCBI nr blastpgi|1603579153e-17757.06%neither inactivation nor afterpotential B [Galleria mellonella]
NCBI nr blastxgi|1603579153e-17457.06%neither inactivation nor afterpotential B [Galleria mellonella]
Group
KEGG pathway 
InterPro domain[2-503] IPR0042948.9e-185Carotenoid oxygenase
Orthology groupMCL10623 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212591-TA
ATGTCTGACACGGAAAGATTATACGATTACAATATGAATATATGGCTGAGGAATTGTGAGGAGGAGATACAGGAACCGATACGTGGAGAAGTGACGGGTAAACTTCCCTCATGGCTCCAAGGTACTCTCGTCAGGAATGGTCCGGGACGAAATAAAATCGGTGATTGTGAATATGAACACGTCTTTGACGGACTGGCTTTGTTACACAGGTTTGATATATCGGAGGGTCAAGTTACTTATCAATGTAAATTCGTGAAGTCTGAGACATACAGAAAGAATATGGCAGCCAATCGCATCGTGGTCACAGAATTCGGCACGACGGCTGTACCGGACCCCTGTCGCAGTATTTTTGACCGTATATCCTCAACGTTCAGATTGAATCGAGATAGAACAGATAACACGGCGGTATCTATTTATCCCTTTGGGGATCAAATTTATGCGATGACCGAAGTACCCGTTATTTACGAAATAGATCCCAGAAACTTAGAAACTGTTGGCAAGAAAGAAATAGAGAAGTCTTTAATAGTATGTCACACCGCTCACCCACATGTGATGCCAAATGGAGACGTTTACAATCTAGGAATGAGTAAAACAAATGGTCTTAAACACGTAGTCGTTAAGTTTACATACAGTGATAAAGGAGATATGTTTGAATCAGCGGAAATAGTCGCCAGCATGACTCCCAGGTGGAAACTCCATCCGTCATACATGCACTCATTTGGCATAACGGAAAACTACTTCGTGATAATTGAACATCCGTTATCGTTGTCTTTATTGGGTATGGCTCGGAGAGTATTTATCACCGCCCCGTTTTCCTCCGTCCTGCACAGCTATCCAGATCAAGACACTCAAATAGTCCTTATCAACAGAATAACCGGCGAGGAGACCAGATACACAACAGACACCATATTTTATATGCACATCATCAATTGCTTTGAATCGGAAGGGAAAGTTATTATTGATTTGTGCAGTTACAAAGACGGAAAAATCATTGAATCCATGTATACACGAGCTATAAAATCAATGCAATCTAATCCGGAGTACGGTGAATGGTGCGAGTGTCGTGCAAAACGCCTCGAAATACCTCTGGACGCGGTCGACTGTAAGGTGGAAGCCAAGTTGATCGCTGATGTTGGATGCGAGGCTCCGAGAATCAATTACGATGTTTGTAATGCTAAACCGTACAGATATTTCTATGGCATCGGTTCCGACATTGGTAGAAGTGACTCTGGCAATCTCGTGAAAGTGGATACAAAAACCGGTGATTACAAAATATGGTTAGAAGAAGACACCTATCCAAGTGAACCGATTTTCGTGCCTCGTCCGGAGGCTGTGGATGAGGATGACGGTGTGCTCTTAAGCGCTCTCGTATGGGGAAGGGATGATCACGCGATAGCTTTACTAGTTCTGGATGCTCGCGATTTGAAAGAAATAGCACGTGTTTGCTTCAAAACCCCCTCGCAGGCAACCAGGTGCTTCCATGGGTGGTTCCTACCCGGCCAACAACTTAAATAG

Protein sequence:

>DPOGS212591-PA
MSDTERLYDYNMNIWLRNCEEEIQEPIRGEVTGKLPSWLQGTLVRNGPGRNKIGDCEYEHVFDGLALLHRFDISEGQVTYQCKFVKSETYRKNMAANRIVVTEFGTTAVPDPCRSIFDRISSTFRLNRDRTDNTAVSIYPFGDQIYAMTEVPVIYEIDPRNLETVGKKEIEKSLIVCHTAHPHVMPNGDVYNLGMSKTNGLKHVVVKFTYSDKGDMFESAEIVASMTPRWKLHPSYMHSFGITENYFVIIEHPLSLSLLGMARRVFITAPFSSVLHSYPDQDTQIVLINRITGEETRYTTDTIFYMHIINCFESEGKVIIDLCSYKDGKIIESMYTRAIKSMQSNPEYGEWCECRAKRLEIPLDAVDCKVEAKLIADVGCEAPRINYDVCNAKPYRYFYGIGSDIGRSDSGNLVKVDTKTGDYKIWLEEDTYPSEPIFVPRPEAVDEDDGVLLSALVWGRDDHAIALLVLDARDLKEIARVCFKTPSQATRCFHGWFLPGQQLK-