Monarch geneset OGS2.0

DPOGS205971
TranscriptDPOGS205971-TA4722 bp
ProteinDPOGS205971-PA1573 aa
Genomic positionDPSCF300164 - 217021-269649
RNAseq coverage671x (Rank: top 19%)
Annotation
HeliconiusHMEL0075480.074.07% 
BombyxBGIBMGA009401-TA0.069.91% 
Drosophila% 
EBI UniRef50UniRef50_D2A1L65e-16343.25%Putative uncharacterized protein GLEAN_08410 n=3 Tax=Eukaryota RepID=D2A1L6_TRICA
NCBI RefSeqXP_969974.21e-16343.25%PREDICTED: similar to Y71H2B.5 [Tribolium castaneum]
NCBI nr blastpgi|3838535211e-16239.06%PREDICTED: uncharacterized protein LOC100879481 [Megachile rotundata]
NCBI nr blastxgi|3071697017e-17536.91%tRNA 2-thiocytidine biosynthesis protein ttcA [Camponotus floridanus]
Group
Gene OntologyGO:00038246.1e-18catalytic activity
GO:00301706.1e-18pyridoxal phosphate binding
KEGG pathway 
InterPro domain[79-286] IPR0154216.1e-18Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[356-671] IPR0154242.4e-17Pyridoxal phosphate-dependent transferase, major domain
[641-670] IPR0154221.2e-08Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL16992 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205971-TA
ATGACAAGTGTTTTGAAAAAGTCAAGAGGCACTAAAGGTAAATCTGAATCGATGCAAGAGGACTTGCCGTCTAGATGGATCGCCAGAGGATTGAAGGAAGATGCCGGGAAACTCATAAAATACATCGATGATAATGTTATCGGAAAAGAAAATTCGTTCTGTGGACCCTTCGGAAGGAGAAAAGTAGTGTACTGCGACTACAGCGGCTCGGGTCGATCACTGCAGTGGCTCGAGGATTATGTCACGCGGGACGTCCTGCCACGTCATGCCACACGCTGCTCAGCTCTTTCCGTGACATCGGGGCAGACTGAGGTCTATAGGTCAGAATCGAAGGAGATTTTACGAACAGCTGTAGGGGCTGGTGATGAAGATGTAGTTGTGATTGGAGCATCTCTGACGCAGTTCCTACGGGCTCTGAGGCCGGAGAATGTAGTCGTGTTTGTCTCTTCACGTGAAACGGAGAGACAGCTCACACCGTGGAAGGAGTTTGGGGCTGAGATAATCAAGATACCGGAAACTAAAGAAGGTTTTATTGATCTGAACAACTTAGAGATGAGGTTACAGGAGAACACGGAGGACGGAAAACGTATGATAGGATTCTTTCCAGCCGCTTCCAAACTCACTGGAGTTTTAGCTGATGATGTCGCAACGACGCTGTTGCTTCACCAGTACGGAGCTTGGTCTTTTTGGGACTACACCCTCGTAGCACCGACTTGTGCTGTCGATATGAACCCGACCTTCCTGGGGGTGGAAGACGGTATGGTGAATAAGGATGCTCTGTTTTACAACTGCGAGAAATTCGTCGGGGGCATCCAGGGTCCGTACATCACTGTGGTGAAGAAGCACCTGTTCACAAACACCGCGATATATCATGATGATGTAGAGGTACTGTCTGAGAGGATATCAGAGTACAAGTGCACGGAGGCAGTTCGAGCAGCTATCGTGATGCAGATACGAGACGGAATCGGCCTCCAGAACATAGCTGACAGACAGGATCATATTACTAAGAGAGTAGATCGAAGGCATCATTATATTCTTATGTTCTATAACATTCAAAACAATTCCAAGATAATCAAGATACCGGAAACTAAAGAGGGTTTCATTGATCTGAACAACTTAGAGATGAGGTTACAGGAGAACACGGAGGACGGAAAACGTATGATAGGATTCTTTCCAGCCGCTTCCAAACTCACTGGAGTTTTAGCTGATGATGTCGCAACGACGCTGTTGCTTCACCAGTACGGAGCTTGGTCTTTTTGGGACTACACCCTCGTAGCACCGACTTGTGCTGTCGATATGAACCCGACCTTCCTGGGGGTGGAAGACGGTATGGTGAATAAGGATGCTCTGTTTTACAACTGCGAGAAATTCGTCGGGGGCATCCAGGGTCCGTACATCACTGTGGTGAAGAAGCACCTGTTCACAAACACCGCGATATATCATGATGATGTAGAGGTACTGTCTGAGAGGATATCAGAGTACAAGTGCACGGAGGCAGTTCGAGCAGCTATCGTGATGCAGATACGAGACGGAATCGGTCTCCAGAACATAGCTGACAGACAGGATCATATTACTAAACAAGTCCTCTCGCATATCAAGAACATACCTGAGTTGATTTTATTGGGAAGCGAGTCCCGTACGAGACGGTTGCCAATCTTTTCTTTAATGGTAAAACATCCGAGGGGTATCTTCCTGCATCATAACTTCATCTGTGCTGTTCTCAACGACGTGTTCGGTATCCAGGCCCGAGGGGGACTTAGCGGAGATTTGAATTACGGATCAGATATCCTAGGGATAGATGATCATCTGTTGAAGGAATACGAAAAGTTATTGGATGTTGAAGCTCAAAAAGAAATATCTCGAGTTCGTAAACTAAGTGTGGTTAAATCCCCGGAAGTTCCAATCCACAATGAACCATTACGGCCCGGATTTTGTCGGTTATCACTACCTTTCTTTATGTCTGAGAATGAACTGGCGTTCGTCTTAGAAGCTTTGAAAATGGTTGCCACGGAAGGATGGAAGATTTTGCCACAGTATGTAGTGAACTCGGAAACCGGTCAATGGAGACATCATTCGTGCTCAGTACTTAGAGATAAGAAGTCATTGTATTCAATAAGGTTCAACGATGGAAAAATTACAGCGAACGAGAGACGAGTATCAGGTCCAGGAATATTCCCGCAGACTTTCGCGGAATGTTTGCAAACCGCTCGTAATCTTTTCAATCGAGCTAGGAAACTGGCCATGAAATGTGCAACGGTTGAGCCCGAAGTCAGTTTTAACCCGAAAATAGATTACTTGAGATGGTTCATGTTGCCGAAGGAGGCTCACGACCTGTTGTTGGGCAAGTCCGCGAATGTGAAACACATTGTGCCATTCGATCCCGTGGGCTACACGGGGACGAGAAAGAGCCTGAACAATTCGAGGTCATCTCACACCTCCTCCCCCGTCCTGGGGACTACCTCCAGACACTTCAGCTTATCGGCCATCGACGACTGCCATTTACTGAATCTGAAACAAAGACAGAAGTTCTTTTCAAGAGAATCCAGCTTGAAAGAAACCACGAAAGAGAAAGCTGAGACGATGTCATCAAACCCGGTGCAATTCGCTGTTGGAGAATCCGTTTCCCCTCTACGTATAGTACCACAGAATGCGCAGACGATGCTCGGCAGGTCCAGATGCTACTCCTTAGGATCCGATCTGCCGCCGGTTCAGTTGAGTGCACGAGCGAGACTGAACCTTGGACTCAAAGAAACTCCTGGAAACGGTGAAAAAACTATCGGCTTCTGTAACTGCGGAAGTCAAACCGATCTACCCTCGTTGGACGATATGTCGCCCACTAAGAAGTATCCATACAGCACACAGAGCAGTTCTTCGATATCTGATTGCAGTCAAGTGGGCCGTACCTCGCCAACTACATCAGTGACCTCGCACACGTCTGAGGATCTCGAAGCTATAGTTAAAGTGACGACCAACGAAATCGCAACACAAATAAGATCCCAGTTAAGAGGGGTCATCTCTAAGGTGGACGATATACTAGAGAATTCCGACTCTCTCGAGCAGTCCAATATGAGTATGACGTCCATATCTAGTCAGAGTGATAAGAATTCCGTATCAGTCGTGGACGTCGCCGAACTGTTGATCGGCATGTCGAGGGAAATAGCTTCGGAGGTGAAACATGAGTTCAGAGAAATGGTTAATACTGTTGACGAGATGATTTCTCCAGAACTTTCCGGATCCAGAAGAAGTTCACCGCCGCAGACCGGAAGGAGGAGGATAGGATCCGGTCCGGAACTCGGGTTGGACTTGAACGTCAGTCAAACCCAGGTGTTGAAAAAATGTCCGACCAGTCCAGTACTTCCCATCCAGTATGATGATGATCATTGCTGCAAACGCTCACCCGGCGCCCAGTCGCCCCTGTCAGCCCAGAACACTCCGAGTCATGAGGCATCAGCACCGAACAGCATCTCGTTCAACTCCAGCGAGACTTCTACACCGGACACGATCGTACAAGTGATGACGTCACAGAACTCTCCCATACTCTCCAAATCATCGAGCGCCAACAAACTATCCGACGATGAAACCTGTACGGACCCGAGATGCAGGCATTACTGTATCAAGAAGAACTGGTGCCAAAACCCGTCCATCAGTTCCCAGGACAGCGGAATAAACCTGACATTCACAGAAACCGACTCCTACATGGACTTCGACAAATGGCGAACATCCTCGGACACGTCATCGAACAAACTGAAAAAGCTCCAGGGTAGATTACGGATGTGTCAAAAGTACGAGAAGAGCGAGGTGCCGGACATTATAGAGGGTGTTCCAGTGTGTTCAGGGGATCACGCGCGGACAGCCAAGAACTGTGATCCTGATACGGCGAGGGTTGTCTTCCAGATACCTGATGATAATGAAAAGCAGAATCGCACAACGGCGCTGGATTCTAAGCGTTCCAGCCGTTCGTCGAACGCGTCGTCTAGCAGTTCGCGCTCCAGCGGCTACGGCACCGACCACAGGACTCCAGAGGAACAATTTTATGAGCGAAGTGAATCTGATCGTATCTTCAAGCCGGATTGGGAGGCAGACAGTACGTGTAGCGAGGCGTCCCTCACAGACTTCACTTTGGACGATGAGGGGAAATGGCACTGTCCGCCCCGAGAAGTCTGGAGGGCCACTGTGGAGGCTATACACGAATACAACATGGTGAGAGCCGGTGACAAGATCCTCGTATGTCTGTCTGGTAGCCGGGAGTCGGTCGCATTGCTCCATACCATGCACCAGGAGCACAACATCCGTATCCTCCGCCCGTTCATCTACGTCCGCTCTCAGGACCTGGAACACTTCGCTCGCTCCCAGGGTCTGCCTGACTTCGGAAGAGATTTGTCTGATAAACCTGTCCTAGGTCCCAGTAAAAACAAATTGGATCGATCTATATCTCTACCATGCGGGTCTGACAAAGGAGATTCCTTGGACCGGGAGGAGGATCTATCAGGGTCCCTGCCCGAGCTGGTGGATCCGATGTCTTCGGCTCGTGAAATCCTGAAAACTCATGAGAAATTATATCCTTATTTATTCTCCAGCCTGAAGAACGCTCTGCATCCACTCATCAGCGGCAGGAACATTGATAAGGATAACAGACACAGAAAGAAATCTGTTATACAGATGAAGAACGGCTCCCCCGTGTATGACTCGGAGGAGGGGACGGAGGAGGAGCCGGTGCCATAG

Protein sequence:

>DPOGS205971-PA
MTSVLKKSRGTKGKSESMQEDLPSRWIARGLKEDAGKLIKYIDDNVIGKENSFCGPFGRRKVVYCDYSGSGRSLQWLEDYVTRDVLPRHATRCSALSVTSGQTEVYRSESKEILRTAVGAGDEDVVVIGASLTQFLRALRPENVVVFVSSRETERQLTPWKEFGAEIIKIPETKEGFIDLNNLEMRLQENTEDGKRMIGFFPAASKLTGVLADDVATTLLLHQYGAWSFWDYTLVAPTCAVDMNPTFLGVEDGMVNKDALFYNCEKFVGGIQGPYITVVKKHLFTNTAIYHDDVEVLSERISEYKCTEAVRAAIVMQIRDGIGLQNIADRQDHITKRVDRRHHYILMFYNIQNNSKIIKIPETKEGFIDLNNLEMRLQENTEDGKRMIGFFPAASKLTGVLADDVATTLLLHQYGAWSFWDYTLVAPTCAVDMNPTFLGVEDGMVNKDALFYNCEKFVGGIQGPYITVVKKHLFTNTAIYHDDVEVLSERISEYKCTEAVRAAIVMQIRDGIGLQNIADRQDHITKQVLSHIKNIPELILLGSESRTRRLPIFSLMVKHPRGIFLHHNFICAVLNDVFGIQARGGLSGDLNYGSDILGIDDHLLKEYEKLLDVEAQKEISRVRKLSVVKSPEVPIHNEPLRPGFCRLSLPFFMSENELAFVLEALKMVATEGWKILPQYVVNSETGQWRHHSCSVLRDKKSLYSIRFNDGKITANERRVSGPGIFPQTFAECLQTARNLFNRARKLAMKCATVEPEVSFNPKIDYLRWFMLPKEAHDLLLGKSANVKHIVPFDPVGYTGTRKSLNNSRSSHTSSPVLGTTSRHFSLSAIDDCHLLNLKQRQKFFSRESSLKETTKEKAETMSSNPVQFAVGESVSPLRIVPQNAQTMLGRSRCYSLGSDLPPVQLSARARLNLGLKETPGNGEKTIGFCNCGSQTDLPSLDDMSPTKKYPYSTQSSSSISDCSQVGRTSPTTSVTSHTSEDLEAIVKVTTNEIATQIRSQLRGVISKVDDILENSDSLEQSNMSMTSISSQSDKNSVSVVDVAELLIGMSREIASEVKHEFREMVNTVDEMISPELSGSRRSSPPQTGRRRIGSGPELGLDLNVSQTQVLKKCPTSPVLPIQYDDDHCCKRSPGAQSPLSAQNTPSHEASAPNSISFNSSETSTPDTIVQVMTSQNSPILSKSSSANKLSDDETCTDPRCRHYCIKKNWCQNPSISSQDSGINLTFTETDSYMDFDKWRTSSDTSSNKLKKLQGRLRMCQKYEKSEVPDIIEGVPVCSGDHARTAKNCDPDTARVVFQIPDDNEKQNRTTALDSKRSSRSSNASSSSSRSSGYGTDHRTPEEQFYERSESDRIFKPDWEADSTCSEASLTDFTLDDEGKWHCPPREVWRATVEAIHEYNMVRAGDKILVCLSGSRESVALLHTMHQEHNIRILRPFIYVRSQDLEHFARSQGLPDFGRDLSDKPVLGPSKNKLDRSISLPCGSDKGDSLDREEDLSGSLPELVDPMSSAREILKTHEKLYPYLFSSLKNALHPLISGRNIDKDNRHRKKSVIQMKNGSPVYDSEEGTEEEPVP-