Monarch geneset OGS2.0

DPOGS212592
TranscriptDPOGS212592-TA1464 bp
ProteinDPOGS212592-PA487 aa
Genomic positionDPSCF300245 - 317475-321668
RNAseq coverage103x (Rank: top 60%)
Annotation
HeliconiusHMEL0060250.061.48% 
BombyxBGIBMGA005193-TA3e-10951.84% 
DrosophilaninaB-PA8e-8743.34% 
EBI UniRef50UniRef50_A8Y9I21e-16655.40%Neither inactivation nor afterpotential B n=2 Tax=Obtectomera RepID=A8Y9I2_GALME
NCBI RefSeqXP_967460.14e-12143.25%PREDICTED: similar to beta-carotene dioxygenase [Tribolium castaneum]
NCBI nr blastpgi|1603579155e-16655.40%neither inactivation nor afterpotential B [Galleria mellonella]
NCBI nr blastxgi|1603579154e-16355.40%neither inactivation nor afterpotential B [Galleria mellonella]
Group
KEGG pathway 
InterPro domain[15-485] IPR0042941.1e-161Carotenoid oxygenase
Orthology groupMCL10623 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212592-TA
ATGCCCAAAACTAAAATAGAAAATTTTGCGACTCAGTCATGGACAACGCCGATGTTAAGGACGTGTACGCAAGAAATTACAAATCCTTTAGAGGGTAAAGTTTCAGGGAGTATTCCGTCTTGGCTTCAAGGAACGCTGATAAGAAATGGATCTGGTGCCATCAAAATTGGCAATTCTGAGTTTGGTCATGTCTTTGACGGATCATCGCTGCTTCACAGATTTTCGATTAAGGGTGGTCGGGCAACATATCAATGTAGGTTCCTAGATTCAAAAACATTAAGAAAAAACAGAGCTGCTAATCGTATTGTGGTCACAGAATTTGCCACGGAAGCTGTTCCAGATCCTTGCCACACAATATTTGATAGAATATCAACCATTTTCAACCCTGCAGAGGCAGTCACTGATAATTGTGGAATATCTGTATATCCTTTCGGAGATCAGATGTATGCCATGACAGAATATACTAATGTTTACAAAATTGACACTGAATCTCTCGATACATTAGAAAAGAAGAGTCTTTTAAATTCATTAATCGTCTGCCACACAGCGCACCCACATGTAATGAAAAATGGAGACGTTTACAATATAGCTCTTGGCGTGGTTAAGGGGCTATTAAAACACATGATTGTCAAATTTCCTTACACAGAAAAAGGTGACATGTTTGATCTCGTAGAAGTTGTTCACACTGTGGACTCCAGGTGGCTCTTACATCCTTCATATATGCATTCTTTTGTGAGCTATGTGAGCAATCGGCCATTTTCATCGAGTGTGTTATGGTATGACGGTTACGAGACACAGATAGTGTTAGTAAATCGCAACACCGGAGAGCAGACCCGTTACACTACAGAGACTTTTTTCTTTATGCACATTATTAACTGTTTTGAATTAGACGGCCAATTGATCATAGATGTGTGTTCTTATAAAGATGCAAAAATTATTGACGCTCTCTATGTGGAAGCAATTAAGAACATACGCAGCAACCCTGATTACGCGAAATGGTGTCAATCCCAACCCAAAAGAATCGAAATTCCTTTAAACAGCCCAAATAACAGTAGGGTGGAGATCTCAGTTATAGCAGATATTCCAATAGAGACACCGAGGATAAACTATGAATTGTACAATGGACGACCCTATCGATACTTTTACGGAATGGGCCCACAGGTGCACTCAATTTATGGCGGAACGATAATTAAAGTTGATACTAAAAGCGGTGAAGTGAAGACATGGTGTGAAGTGGACGCAAATCCGAGCGAGCCTGTATTTGTTGCCCGACCGGATGCTCAGGATGAAGATGATGGTGTACTATTGAGTGCCTTACTTTGGGGCAGTGACGAGAATGCAACAGCACTCCTCGTGTTGGACGCCCGAGATCTCACAGAACTCGGACGTGTTAGGTTTACAACACCCTCGCAGGCACCCAAGTGCTTTCACGGGTGGTTTCTACCCGACACGAATTCAACTTGA

Protein sequence:

>DPOGS212592-PA
MPKTKIENFATQSWTTPMLRTCTQEITNPLEGKVSGSIPSWLQGTLIRNGSGAIKIGNSEFGHVFDGSSLLHRFSIKGGRATYQCRFLDSKTLRKNRAANRIVVTEFATEAVPDPCHTIFDRISTIFNPAEAVTDNCGISVYPFGDQMYAMTEYTNVYKIDTESLDTLEKKSLLNSLIVCHTAHPHVMKNGDVYNIALGVVKGLLKHMIVKFPYTEKGDMFDLVEVVHTVDSRWLLHPSYMHSFVSYVSNRPFSSSVLWYDGYETQIVLVNRNTGEQTRYTTETFFFMHIINCFELDGQLIIDVCSYKDAKIIDALYVEAIKNIRSNPDYAKWCQSQPKRIEIPLNSPNNSRVEISVIADIPIETPRINYELYNGRPYRYFYGMGPQVHSIYGGTIIKVDTKSGEVKTWCEVDANPSEPVFVARPDAQDEDDGVLLSALLWGSDENATALLVLDARDLTELGRVRFTTPSQAPKCFHGWFLPDTNST-