Monarch geneset OGS2.0

DPOGS202254
TranscriptDPOGS202254-TA3975 bp
ProteinDPOGS202254-PA1324 aa
Genomic positionDPSCF300032 - 576401-590984
RNAseq coverage297x (Rank: top 38%)
Annotation
HeliconiusHMEL0050990.079.74% 
BombyxBGIBMGA004906-TA0.071.70% 
DrosophilaCG8129-PB7e-12658.69% 
EBI UniRef50UniRef50_Q210802e-12557.01%Protein K01C8.1 n=5 Tax=Caenorhabditis RepID=Q21080_CAEEL
NCBI RefSeqXP_624902.13e-15162.74%PREDICTED: similar to CG8129-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3407141598e-15262.82%PREDICTED: threonine dehydratase catabolic-like isoform 2 [Bombus terrestris]
NCBI nr blastxgi|3407141591e-14663.33%PREDICTED: threonine dehydratase catabolic-like isoform 2 [Bombus terrestris]
Group
Gene OntologyGO:00081526e-101metabolic process
GO:00038246e-101catalytic activity
GO:00301706e-101pyridoxal phosphate binding
GO:00165973.3e-05amino acid binding
KEGG pathwayame:5525239e-151 
 K01754 (E4.3.1.19, ilvA, tdcB)maps-> Valine, leucine and isoleucine biosynthesis
    Glycine, serine and threonine metabolism
InterPro domain[471-822] IPR0019266e-101Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL10849 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202254-TA
ATGGTGTTTCAGACGGTCGAGTTCGACCCCATGTGTGACAAAGACAATCCTCAAATAATAAGTTTTGAGGACGTATCAGCAGCTGCCTATAGAATCCAGAGCGGGATTATAAAAACGCCATGCGTGAAATCTCACATGTCATCTATTTTTGAAATGGATATTTATCTCAAAAATGACTTTCTGCAACACACTGGCAGGCGAGTATTTATAATGAAATCCATAAAAAAAAATAATGGAATTTTCAAAGAACGTGGGGCTCGTAACGCCCTCATTTTGCTATCATCCGAGGCAAAAACTCGCGGTGTGATATCAGCTTCACTCGGAAACCACTCCCAGGGTCTCAGCTACCACGCACAACAGCTGAACATACCGGCCACTGTGGTCATGCCAAATGTGGCACCCATCATGAAAATACAGAACTGCCGTTCGTACGGCGCTAACGTCGTCATCCATGGGCACGACATGAAGGAAGCGAAGTACCATGCTATGACGCTAGCCAAGGAGAGGGGACTCACATATATTAACGGCTACGATCACCCTCACATAATGGCCGGTCAAGGCACCGTGGGGTTGGAGATAGTGGAACAGGTTCCTGATGTAGACGCGGTCATCGTCCCCGTGGGCGGCGGCGGTCTACTCGCCGGAGTCGCCACTGCCATTAAGAATATTAAGCCCCATGTTCTCATATACGGCGCTGAAACCGAAAAATGTCCAAGCATGAAGATGGCTATCAAACATCAACAGCCAGTGAGCGTCAATATCCGGTCCACGCTGGCGGACGGGCTCGCCGTGCCCACGGTCGGCTACAACGCATTCAAAACATCGAAATCGCTCATGGATAGGATGATAACAGTAAACGAGGATTGGATTGCTCGCGCGATTCTGCGTTTGGTGGAACAAGAAAAGTATGTTGTGGAGGGCGGCGGCGCTGTCGGAGTGGCGGCTATCATGGCCGGACTCGTGCCCGAGTTGGTTGGGAAGAAGGTGGTATGTATTCTGTCGGGTGGTAACATCGACACCACAATCCTCGGTCGTTGCTTGGAGCGGGGTTTAGCCGCGGAACAGAGGCTCGTCAAATTCAAGGTCACCGTCAGCGACCGACCCGGCGGCATCGCGGAACTTTGCAAACTCATCTCCTCCATAGGAGTATCTATCAAGGATATAATGCAAGAACGAGCTTGGGTCTTTGGCGACATATTTAGCGTGAAAGTCGTTTGTGAAACCAGAGGTCCGGAACATTTGGAGGAGCTGGAGAAAATGATAACTGACACATATAAAGAGTGGAATTTCTCCAGGGATTGTGAAGAATTTGACAGAAATGATAGAAGACTGAGCACGTTCTCCATCGATGAAACCCAAGACGTTGAATATGACGAATATTGTGATCCGAACAATCCTCGGAAGATTAAATATGACGATATTTTGGCTGCATATAGAAGAATTACGGGTTACGTATTGAAAACGCCTTGTACGAGAGCTCACATGTCAGATAGGTTGGGTATGGAAATATATTTAAAGCAGGAGTTCATGCAACACACTGGATGCTTTAAGGAACGCGGAGTTAGGAATACTATGCTGTTACTGTCGGAGGAGCAAAGGAAAGTTGGTGTAATAAGCGCTTCGACGGGGAACCATGGCACTTCAATGAGTTATCACACCACACAGATGGGTATTCCTTGTATAGTTGTGATGCCGGTTCGAGCACCTATCACTAAACTGACTAAATGTCAAAACTTTGGAGCGAAAACAATACAACATGGCGACAATATGGCCGAAGCGAAACATTACGCTATGGCTCTGTCAAAAGAAAAGAAATTATACTACGTTAATGGTTATGATCACCCAAACGTCATAGAAGGTCAGGGTACTATCGGCATAGAGATTATAGAACAGGTACCGGATGTAGACGCTGTCATTGTACCTGTTGGCGGAGGTAGTCTTTTATGCGGCATAGCTGTTGCCGTGAAACATTTAAAACCGGACACGGAAGTTTATGGTATACAAACAGAAAAAGCTTATAGTATGGTAGAAGCTTTAAAGAGAAATGAAAGGGTGAAAATTGTCATCGACTCTACCATCGCTGACGGTCTAGGAGTAAACTTAGCAGGCGTCAATACTTTTCACAATCTGAAAAGCGGAATATTGGATAAAATGGTAATAGTTAAAGAGGACTGGGTCGCCCGTGCTATAATGCATGTGGTCGAGGAAGAGCGCTACGTCATAGAGGGTGCTGCGGCTGTCACCATAGCGGCCGTTATGGCGGGGCTTTTCCCGAATCTTAAGGGTAAAAAGGTGGTATGCGTGTTGTCTGGTGGTAACATCGACACAACCATCCTGGCTCGGTCGCTGGAGCGCGGTATGGCCGCGGAGGGTAGGTTGGTGAAATTCAAGGTGACGGTGAGCGACCGTCCCGGGGGTATGGCGGAGCTGTGCTCGCTGCTAGCCACCATCGGCGTCACCGTCCGCGACTGTATACCGGAACGAGCCTGGGTCAAGGGAGACGTGTTCAGTGTTGAGATGAAAGTGATAGTTGAGACCAGAGGATGGGATCACACGAAAGAACTGATAGAGCAAATAAAGAAGAAGTACAAGGAATGTTTCTTCCACGAGATGAGCGAACGCAGCGACAAGGGCGCCGGCGCCAAGAGAGGCCCCTGCCTCGCCCCCAACCCGGAAGATTTCGATGAGTTCTGCGATCCTGACAATCCAAAAATAATTAAATATGAAGATGTCGTCGACGCTTTAAAACGTATAAGAAAATATATACCGCAGACTCCAATAATAGCTTCACACTATCAGAAAGAATGCGGCATCAATCTTTTCTACAAACTGGAAACGGTAATGAGAACAGGAAGTTTTAAGGAAAGAGGTGCATTAAACGCGCTAGATTTATTGCCAAGAGATAGACAAAAGATGGGCGTTGTTGTAGCGTCTCTTGGAAACCAGGCAATGGGAATATGTTATTATGGTAAAAAACTAGGGATACCAGTGACTGTGGTGATGCCAACCTCTGTGCCAGTCATAAAACTACAAATGTGCAGCGACATGGGCGCCAAAGTTGTAGTTCAAGGTCACAATTTGGTGGACTCTCAGAAATATGCTCGAGCCTTAGCAAAAGAGAAAGGTCTCACTCATATTAACGCTCGTGACTATCCCGATGTGATGGCGGGCTATGGTACTGTAGCAATAGAAATTATGGAGCAAGTGCCCGCTTTGGACGCTGTTCTATTGCCCATTGGGACCGGCGGCTTGGCGGCAGGGGTCGCTACTGTCATCAAACACGTCAATCCAAACTGCCTTGTATACGGAGTTCAATCTGAACGACTGCCAACATTTTACAAGTCCCTGGAAGCGGGAGAACCTCTTACTTTGCCCTACGAACCATCAATAGCCGAGGGTATAGCGATGCCGTATGTCGGAGTGAATGCGTTTAGAAACGCACAAAATTTGCTCGACAAATTAATATTAGTTTCTGAGGACTGGATAGCTAGGTCTATTTTGCATTTAATTGAAAAAGAACGGCTAGTTGTAGAGGGCGCTGGTGCTTGTCCGTTGGCAGCAGTTCTATGTAGCCAGGTCCCTGAACTGAAGTCAAAAAATGTCGTTATAATCCTCAGTGGAGGTAACATAGACGCTGTATTACTAGGCCGTTGTTTAGACAGAGGACTGGCAGCAGAAGGTCGTCTTATAAAATTCAAAGTACTCGTGAAAGATTTAGGCACAGAGTATGAAAGGTTTACTAAACTGCTAGCGGATAACGGTTATAATTTGGTAAGACAGTTTCAGGATCGCATTTGGGTGGAAAACGAAATTTACCGAGTCGAGATGAAAGCCGTCTGCGAGACGAGAGGACTCGACCACGCTCTGGAGTTGAAGAGAATAATTGAAAAGACATATCCAAATGAATGCGCTTTTGAAACGGAACCCTTTGATAGCGACAACACCTGCTCTTGCTATATACCTTCAAAATGTTAA

Protein sequence:

>DPOGS202254-PA
MVFQTVEFDPMCDKDNPQIISFEDVSAAAYRIQSGIIKTPCVKSHMSSIFEMDIYLKNDFLQHTGRRVFIMKSIKKNNGIFKERGARNALILLSSEAKTRGVISASLGNHSQGLSYHAQQLNIPATVVMPNVAPIMKIQNCRSYGANVVIHGHDMKEAKYHAMTLAKERGLTYINGYDHPHIMAGQGTVGLEIVEQVPDVDAVIVPVGGGGLLAGVATAIKNIKPHVLIYGAETEKCPSMKMAIKHQQPVSVNIRSTLADGLAVPTVGYNAFKTSKSLMDRMITVNEDWIARAILRLVEQEKYVVEGGGAVGVAAIMAGLVPELVGKKVVCILSGGNIDTTILGRCLERGLAAEQRLVKFKVTVSDRPGGIAELCKLISSIGVSIKDIMQERAWVFGDIFSVKVVCETRGPEHLEELEKMITDTYKEWNFSRDCEEFDRNDRRLSTFSIDETQDVEYDEYCDPNNPRKIKYDDILAAYRRITGYVLKTPCTRAHMSDRLGMEIYLKQEFMQHTGCFKERGVRNTMLLLSEEQRKVGVISASTGNHGTSMSYHTTQMGIPCIVVMPVRAPITKLTKCQNFGAKTIQHGDNMAEAKHYAMALSKEKKLYYVNGYDHPNVIEGQGTIGIEIIEQVPDVDAVIVPVGGGSLLCGIAVAVKHLKPDTEVYGIQTEKAYSMVEALKRNERVKIVIDSTIADGLGVNLAGVNTFHNLKSGILDKMVIVKEDWVARAIMHVVEEERYVIEGAAAVTIAAVMAGLFPNLKGKKVVCVLSGGNIDTTILARSLERGMAAEGRLVKFKVTVSDRPGGMAELCSLLATIGVTVRDCIPERAWVKGDVFSVEMKVIVETRGWDHTKELIEQIKKKYKECFFHEMSERSDKGAGAKRGPCLAPNPEDFDEFCDPDNPKIIKYEDVVDALKRIRKYIPQTPIIASHYQKECGINLFYKLETVMRTGSFKERGALNALDLLPRDRQKMGVVVASLGNQAMGICYYGKKLGIPVTVVMPTSVPVIKLQMCSDMGAKVVVQGHNLVDSQKYARALAKEKGLTHINARDYPDVMAGYGTVAIEIMEQVPALDAVLLPIGTGGLAAGVATVIKHVNPNCLVYGVQSERLPTFYKSLEAGEPLTLPYEPSIAEGIAMPYVGVNAFRNAQNLLDKLILVSEDWIARSILHLIEKERLVVEGAGACPLAAVLCSQVPELKSKNVVIILSGGNIDAVLLGRCLDRGLAAEGRLIKFKVLVKDLGTEYERFTKLLADNGYNLVRQFQDRIWVENEIYRVEMKAVCETRGLDHALELKRIIEKTYPNECAFETEPFDSDNTCSCYIPSKC-