Monarch geneset OGS2.0

DPOGS212590
TranscriptDPOGS212590-TA1689 bp
ProteinDPOGS212590-PA562 aa
Genomic positionDPSCF300245 - 348473-355260
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0060258e-16551.29% 
BombyxBGIBMGA005193-TA3e-15471.10% 
DrosophilaninaB-PA8e-13042.50% 
EBI UniRef50UniRef50_A8Y9I20.072.24%Neither inactivation nor afterpotential B n=2 Tax=Obtectomera RepID=A8Y9I2_GALME
NCBI RefSeqXP_967460.11e-15652.98%PREDICTED: similar to beta-carotene dioxygenase [Tribolium castaneum]
NCBI nr blastpgi|3844022760.071.37%predicted carotinoid oxygenase [Bombyx mori]
NCBI nr blastxgi|3844022760.072.17%predicted carotinoid oxygenase [Bombyx mori]
Group
KEGG pathwaytgu:1002289042e-84 
 K00515 (BCMO1, BCDO1)maps-> Retinol metabolism
InterPro domain[5-508] IPR0042942.6e-199Carotenoid oxygenase
Orthology groupMCL10623 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212590-TA
ATGGAACAGTGTAAAGAAAAACTTTATCCAAACTGTGACAACCAAGTCTGGCTGCGGTCTTGTGAAGTTGAAGTAAATGAGCCGATAACTGGCGAAATTACAGGAGAATTTCCAAGATGGGTCCGAGGAAGCCTACTCCGTAACGGTCCCGGGTCTTTAAAAGTTGGCTCAATGAGATTCAAACACCTGTTTGATAGTTCTGCATTACTTCACAGGTTCAATATACAGGATGGTCAAGTGACCTATCAATGTCGCTTCCTTCGGTGTAACACCTACAATAGAAACATGGCCGCGGGTCGTATAGTTGTCACTGACTTCGGCACTCAGTCCGTACCAGACCCTTGTCATTCGATTTTCGACAGATTTGCTACGCTATTTTCTCCAGGCGAAGCTTTGTCAGACAACGCTATGATTTCAGTTTACCCTTTCGGGGACGAAATGTATGCGTTCACAGAAGGACCAGTCATTCATAGAATAAATCCGGTGACGTTGGACACTATGGAGCGAAAGAATCTTATGGATAGTGTCGCTCTTGTGAACCACACCTCCCACCCTCATGTTATGTCTAATGGTGACGTGTACAATGTCGGCATGTCTCTTGTTAAAGGCCGACTTCGACATGTCGTTGTTAAGTTTCCATTTATGGAAAAAGGAGATATGTTTCAAAACGCAACCATAGTAGGCAGTATGAAACCTCGCTGGACATTACACCCGGCGTACATGCACACATTTGGGATAACGGCCAATTACTTCGTCATAATTGAACAGCCCCTGTCCGTGTCAATATGTAGGCTGGTACACAATCAGCTCAGGAACCGGCCTCTAGCCTCCAGCCTGAAGTGGTACCCGGAACATAAGACGAATATCGTACTAATAAATCGCGAAACTGGTAAAGAGGTGAAGAGGTATGAGACGGAGACACTATTTTTCCTCCACATCATTAACTGTTACGAGATTGCTAACAAACTGATAGTAGATCTGTGCTCGTACAAGGACGCCAAAATATTGGACGCCATGTACATCCAAGCTATTGAGACAATGCAAACTAACGCAAATTACGCGGATTGGTTTCGAGGGAGACCTAAGAGAATTGAAATAGACCTGAATGCGCCCATTCTGACGCACTTCAAGCCGAGGTTGTTGGCCGATTTGGGCTGCGAGACACCCAGGATTCATTACGATGTTTATAATGGTAGACCTTACAGATACTTTTACGCTATCAGCTCTGACGTCGACGCTGAGAATCCTGGATTGATAATAAAAGTCGACACAGTCACCGGTCAAACTCAGACCTGGTTTAATAATAACTGTTATCCGAGCGAGCCCGTCTTCGTGCCTCGTTGCGACGGAAAGTCCGAGGACGACGGTGTTCTTCTAACCGCTCTGGTGAAGGCTGACGACTCTCACTCAGTCTCCCTGGTATGTCTGTGCGCCGTCACCTTAAAGGAGCTCGCTCGGTGCACCTTCACGACACCGTCTCCAACTCCTAAGTGCCTTCATGGGTGGTTTCTACCCGACTCCGAGGACGACGGTGTTCTTCTAACTGCTCTGGTGAAGGCCGACGACTCTCACTCAGTCTCCCTGGTATGTCTGTGCGCCGTCACCTTAAAGGAGCTCGCTCGGTGCACCTTCACGACACCGTCTCCTACTCCTAAGTGCCTCCATGGGTGGTTTCTGCCCGACGTATAA

Protein sequence:

>DPOGS212590-PA
MEQCKEKLYPNCDNQVWLRSCEVEVNEPITGEITGEFPRWVRGSLLRNGPGSLKVGSMRFKHLFDSSALLHRFNIQDGQVTYQCRFLRCNTYNRNMAAGRIVVTDFGTQSVPDPCHSIFDRFATLFSPGEALSDNAMISVYPFGDEMYAFTEGPVIHRINPVTLDTMERKNLMDSVALVNHTSHPHVMSNGDVYNVGMSLVKGRLRHVVVKFPFMEKGDMFQNATIVGSMKPRWTLHPAYMHTFGITANYFVIIEQPLSVSICRLVHNQLRNRPLASSLKWYPEHKTNIVLINRETGKEVKRYETETLFFLHIINCYEIANKLIVDLCSYKDAKILDAMYIQAIETMQTNANYADWFRGRPKRIEIDLNAPILTHFKPRLLADLGCETPRIHYDVYNGRPYRYFYAISSDVDAENPGLIIKVDTVTGQTQTWFNNNCYPSEPVFVPRCDGKSEDDGVLLTALVKADDSHSVSLVCLCAVTLKELARCTFTTPSPTPKCLHGWFLPDSEDDGVLLTALVKADDSHSVSLVCLCAVTLKELARCTFTTPSPTPKCLHGWFLPDV-