Monarch geneset OGS2.0

DPOGS213793
TranscriptDPOGS213793-TA1242 bp
ProteinDPOGS213793-PA413 aa
Genomic positionDPSCF300212 + 865920-873884
RNAseq coverage699x (Rank: top 18%)
Annotation
HeliconiusHMEL0107164e-7086.52% 
BombyxBGIBMGA009276-TA0.084.44% 
Drosophilav-PA3e-15971.73% 
EBI UniRef50UniRef50_O774579e-17174.55%Tryptophan 2,3-dioxygenase n=4 Tax=Metazoa RepID=T23O_ANOGA
NCBI RefSeqXP_312204.22e-17174.55%AGAP002721-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|385703730.088.86%tryptophan oxygenase [Plodia interpunctella]
NCBI nr blastxgi|385703730.088.86%tryptophan oxygenase [Plodia interpunctella]
Group
Gene OntologyGO:00055068.6e-225iron ion binding
GO:00551148.6e-225oxidation-reduction process
GO:00194418.6e-225tryptophan catabolic process to kynurenine
GO:00048338.6e-225tryptophan 2,3-dioxygenase activity
GO:00164918.6e-225oxidoreductase activity
KEGG pathwayaga:AgaP_AGAP0027215e-171 
 K00453 (E1.13.11.11, TDO2)maps-> Tryptophan metabolism
InterPro domain[3-378] IPR0049818.6e-225Tryptophan 2,3-dioxygenase
Orthology groupMCL12016 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213793-TA
ATGGCCTGTCCTATGCGGTCAGCGTACGATGACAGCCAAGATGGCCCACAGCTGGGAAACGAGGCTGGGATGCTTTATGGAGAGTACCTGATGCTGGACAAACTCCTGAGCGCTCAGAGAATGCTCAGCGCTGAGTCCTCCAAGCCAGTACATGATGAACATCTGTTTATAGTAACACATCAAGCGTACGAGCTGTGGTTCAAACAGATTATATTCGAGGTCGATTCAGTACGAGCATTGCTGGATGTAGAAGGTCTAGATGAAAGTCACACTATGGAGATCTTAAAGCGGCTGAATAGAGTAGTGCTGATACTTAAGCTGCTTGTAGACCAAGTAATGATACTTGAGACAATGACGCCGCTTGATTTCATGGACTTCAGGAATTACTTGCGACCGGCGTCCGGCTTCCAAAGCTTGCAATTCAGACTTCTAGAAAACAAGCTTGGACTCAAGCAGGCCCTGCGTGTGAAATACAATCAAAATTACCAAACAGTTTTCGGAGATGACCCCGAGGCTATTAAATCTCTGCACAAATCCGAAGAGGAGCCGGCTCTGCTCGCGTTGATCGAGCGCTGGTTGGAGCGAACACCTGGGCTAAACGCACATGGATTCAACTTCTGGGGCAAATTCCAGGCGGCTGTCAACAACATGATACAAGAGGATATCGAAGCAGCTATGAGCGAACCAAATGAGATTGTCAGGAACCATAGACTGAGGGATGCGGAGAATAGACGAGAGACTTACCGCTCCATCTTCGACGCGGAAGTTCACAACGCACTGAGATCCCGGGGAGAGAGAAGGTTGTCCCACAAGGCGTTGCAGGGCGCTATCATGATAACGTTCTACAGGGACGAGCCGCGTTTCTCTCAGCCTCACCAACTTCTGATGCTGCTTATGGACATCGACAGTCTCATCACCAAATGGAGATATAACCACGTGATCATGGTTCAGCGCATGATTGGCTCGCAGCAGCTAGGAACTGGCGGCTCGTCAGGGTACCAGTACCTGAGATCTACGCTCAGTGACCGCTACAAAGTATTCCTGGATCTTTTTAATCTGTCCACGTTCCTCCTCCCGCGTTCCCTGATCCCCCCTCTGGATGACGGGATGAAGAAAGATCTGAACCTCATGTGGGGAGATCTCAAGGAAATGGGGGAAAATGGTGAGAACCAATTGAACGGTGAAAATGGTCACCCTTTGGAGCAATCAATCTCGAATTTAACACTCAAAGATAAATCCTGA

Protein sequence:

>DPOGS213793-PA
MACPMRSAYDDSQDGPQLGNEAGMLYGEYLMLDKLLSAQRMLSAESSKPVHDEHLFIVTHQAYELWFKQIIFEVDSVRALLDVEGLDESHTMEILKRLNRVVLILKLLVDQVMILETMTPLDFMDFRNYLRPASGFQSLQFRLLENKLGLKQALRVKYNQNYQTVFGDDPEAIKSLHKSEEEPALLALIERWLERTPGLNAHGFNFWGKFQAAVNNMIQEDIEAAMSEPNEIVRNHRLRDAENRRETYRSIFDAEVHNALRSRGERRLSHKALQGAIMITFYRDEPRFSQPHQLLMLLMDIDSLITKWRYNHVIMVQRMIGSQQLGTGGSSGYQYLRSTLSDRYKVFLDLFNLSTFLLPRSLIPPLDDGMKKDLNLMWGDLKEMGENGENQLNGENGHPLEQSISNLTLKDKS-