Monarch geneset OGS2.0

DPOGS203046
TranscriptDPOGS203046-TA1347 bp
ProteinDPOGS203046-PA448 aa
Genomic positionDPSCF300206 - 16132-18043
RNAseq coverage171x (Rank: top 50%)
Annotation
HeliconiusHMEL0128950.075.62% 
BombyxBGIBMGA006797-TA4e-16969.84% 
DrosophilaCG3645-PA3e-14353.95% 
EBI UniRef50UniRef50_B4LU586e-14255.28%GJ17247 n=4 Tax=Opisthokonta RepID=B4LU58_DROVI
NCBI RefSeqXP_623799.26e-16059.41%PREDICTED: similar to CG3645-PA, isoform A isoform 2 [Apis mellifera]
NCBI nr blastpgi|3503967186e-16360.63%PREDICTED: tRNA-dihydrouridine synthase 1-like [Bombus impatiens]
NCBI nr blastxgi|3503967181e-15961.22%PREDICTED: tRNA-dihydrouridine synthase 1-like [Bombus impatiens]
Group
Gene OntologyGO:00171505.5e-180tRNA dihydrouridine synthase activity
GO:00506605.5e-180flavin adenine dinucleotide binding
GO:00551145.5e-180oxidation-reduction process
GO:00080335.5e-180tRNA processing
GO:00081524.4e-75metabolic process
GO:00038244.4e-75catalytic activity
KEGG pathway 
InterPro domain[1-359] IPR0012695.5e-180tRNA-dihydrouridine synthase
[1-223] IPR0137854.4e-75Aldolase-type TIM barrel
Orthology groupMCL13674 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203046-TA
ATGGTAGATGCCAGTGAATTGGCATGGAGATTACTCAGTCGAAGACACGGTGCTACGCTTTGTTATACCCCCATGTTACACAGTACCGTATTTATTAAAGACCCTAAATACAGAAAAGAGAATTTTACAACATGTACTGAAGACAGACCATTATTTGTTCAGTTCTGTGGTAATAATCCAGAAACAATGGCTGCCGCTGCTAAACTTGTAGAAAGTGATTGTGATGCTATAGACATTAATTTGGGTTGCCCACAATCTATTGCTAAAAGGGGAAGATATGGGTCCTTTCTTCAAGAGGAGTGGCAGTTATTAAGAGACATAGTTTCTACTATGTCAAAAACAGTGTCTGTACCAATTACATGTAAAGTGAGAGTCTTTGAAAGCATTGAAAAATCTGTGCAATATGCGTTGATGCTACAAGAGGCTGGGTGTAAGCTATTGACTGTTCATGGCAGAACTAGAGAACAAAAGGGACCTTTGACAGGTATTGCTAGTTGGGAGCATATTAAAGCCATTCGAGATGCAATTTCCATACCAATGTTTGCAAATGGAAATATACAATGCTTACAAGATGTTGAACGATGTTTGCAATATACAAAAGTTGATGGTGTAATGAGTGCAGAAGGTAATCTTACAAATCCAGCAATATTTGAAGGAATAAACTCTGTATCATGGGAGATTGCTTTAGAATATCTTGACCTTGTGGAGACATACCCCTGCCCAACATCATACATAAGAGGTCACCTATTCAAAATCTTTCATAAAGTATTCACTTTTGATAGCAATAATGAAGAAAGACAGTTGTTGGCTACAGCTCAATGCCTAGATGATTTTAAACAAGTGTGCATTAAAATAAAAAATAAGTATTTGCCCTATCATGAGGGGAGATTGCAATTTGATGATAATGAAGGAATTACAAGAAATCAAAAAAGTTTAATTCTTCCACCATGGATCTGTCAACCATATGTGAGAATGTCTCCTGATGAACATACAAAGAAAATGGAGAGGATAGTAAATAGTCAGGATAATAACAATGATTCTAAAAGAACTTTCGAAGACAATGATGGCAACAAAATATCAAGAAAAAAAATGAAAAAAATGCGAAGGGTTATGAGGCGACCGGTCAAACCTGAGGAAGCATTGAAAAATAGTAGGAGTGGTGATATATGTGTTAATGACACATGCCCCAATCCCCTTGGAGGGAAATGCGAATATCAGCTTTGCAAAAAATGTTGCAGAAATAAATGTTATGAGGAAAATAGAGATTGCAAAGGTCATAGAATATTGGTGAAAACAAGGAGAGAAATGGCTATTAGTTTTGCACAAAACGCTGACAAAATTGCTTGA

Protein sequence:

>DPOGS203046-PA
MVDASELAWRLLSRRHGATLCYTPMLHSTVFIKDPKYRKENFTTCTEDRPLFVQFCGNNPETMAAAAKLVESDCDAIDINLGCPQSIAKRGRYGSFLQEEWQLLRDIVSTMSKTVSVPITCKVRVFESIEKSVQYALMLQEAGCKLLTVHGRTREQKGPLTGIASWEHIKAIRDAISIPMFANGNIQCLQDVERCLQYTKVDGVMSAEGNLTNPAIFEGINSVSWEIALEYLDLVETYPCPTSYIRGHLFKIFHKVFTFDSNNEERQLLATAQCLDDFKQVCIKIKNKYLPYHEGRLQFDDNEGITRNQKSLILPPWICQPYVRMSPDEHTKKMERIVNSQDNNNDSKRTFEDNDGNKISRKKMKKMRRVMRRPVKPEEALKNSRSGDICVNDTCPNPLGGKCEYQLCKKCCRNKCYEENRDCKGHRILVKTRREMAISFAQNADKIA-