Monarch geneset OGS2.0

DPOGS212483
TranscriptDPOGS212483-TA3216 bp
ProteinDPOGS212483-PA1071 aa
Genomic positionDPSCF300222 - 255543-273869
RNAseq coverage253x (Rank: top 41%)
Annotation
HeliconiusHMEL0093183e-15669.01% 
BombyxBGIBMGA009657-TA0.085.61% 
DrosophilaRbcn-3B-PA3e-14761.46% 
EBI UniRef50UniRef50_B0W6Z87e-16265.38%WD repeat protein 7 n=1 Tax=Culex quinquefasciatus RepID=B0W6Z8_CULQU
NCBI RefSeqXP_001844482.11e-16265.38%WD repeat protein 7 [Culex quinquefasciatus]
NCBI nr blastpgi|1700332322e-16165.38%WD repeat protein 7 [Culex quinquefasciatus]
NCBI nr blastxgi|1582971905e-15861.61%AGAP008003-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055153.3e-45protein binding
KEGG pathway 
InterPro domain[487-816] IPR0110463.3e-45WD40 repeat-like-containing domain
[356-668] IPR0159433.8e-27WD40/YVTN repeat-like-containing domain
[19-592] IPR0110475.7e-27Quinonprotein alcohol dehydrogenase-like
Orthology groupMCL10625 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212483-TA
ATGCCCGGTACAAATCTTATAGTACCTGTTGTGTTATGGGGCAAACATGCTCCGACACATTGCATTTCATCAGTATATCTTTCAAGAGATCAGAAAACTCTTGTAACTGGATGCTATGACGGTCAGATATGTATTTGGCAAGTAGATCCCGATTCATTGAAAATGATCCCACGGTGTCTTCTTGTCGGACATACAGCTCCCGTCACATGTTTATCTAGAGCCTCTATAATACAGGATAATAACTTCATAGTGAGCTCCTCTGAGGCCGGTGAAATGTGTACCTGGGATTTGGTGGATGGCAAATGCAGGGAGAGTGTAAAACTGACACAAATACACACAAATATACAGATGATTTTTTTGCCAAAATCGGCGTCAACTTCCAACTGCTCTAATGCCCAGTCAGCGAACACACGGCGCTGTCTATGGTCATTGATAATGGATCCCTTCTCGTTGGAGATCCTGTTCTGTTTGAGTTCGAAGATGAATCCGGATTGGATATCAGCGTTGCATGTTCTACGGCCATCATCGAGGAAGGATGATGTTGTGCTTGCAATTTCCATAACGGGTACAGTCAAGGTGTGGTCCCTCCTCGGCCACGAGAATAAGCAATCGGAACCCATATACGAGAATGAGTCAAAGCAGATCCGCTGTCTGAACGCCCTCTCACTGAACTGCTGCGCCTACAACCAGCGCACAGTGTTGATAGTCTGCGCCAAGTACTGTCAGATATATGATGCGGGTGACTTCTCAGTGCTGTGTTCGATCCAAGCGCCATCTGGCGAGCGTTGGATGTCAGGTGACTTCCTCGCACCGGACAGAGTCCTAGTTTGGTCGACCGAAGGAAAAGGGTACCTTTATAAACTACCCGCAAACTCTGTACCAGACCACAAAGGCTTTCATGGACCAACAGTTGAACACGATAGTCCAGTGTTGTTCAGCACCTTGGCTCATCCAGATAATAAGTTGCTGTCATGCCCGCCAGCCATGCGTTTCATTCTGACCTCAGTCGGTGGAAAACAGAGAAGATTACTTCTGAGAGGAGATTCTCATGGCGTCGTCTCTTTATGGAATGTAGATGATGCAGCCACGCCACAGATTAAAAACGTTTCTTCTCATGCTCCCATCGTGATGACGAGCTTGGAAATGGCCTGGGCGGAAATGAACCCCAGTCCGGTTGGTATATTGGATCAGTTCGGACACCCAGATGAACCTGTCGGTGGAAAACAGAGAAGATTACTTCTCAGAGGAGATTCGCATGGCGTCGTCTCTTTATGGAATGTAGATGATGCAGCCACGCCACAGATTAAAAACGTTTCTTCTCATGCTCCCATCGTGATGACGAGCTTGGAAATGGCCTGGGCGGAAATGAACCCCAGTCCGGTTGGTATACTGGATCAGTTCGGACACCCAGATGAACCTGAAATAAAACTGACTTCCTCAATATACCTGCTGGCCGCTAACCGTTTGGTGGTAGGTCGCGAGGACGGGTCCATAGTGATAGTGAACGCAACCCATACAGTACAGCTGCAACTCCTGCACGGGAATCACCAGCAGTTGAACGACTGGCCCCCCCACCAACTGCTTTTGGGCCACAACGGTAGAGTGAACTGTTTGCTTTACCCACATTATGTTAATTACAGGTACGACAAGGCCCACCTAGTATCTGGGGGTATAGATTTCGCTGTGTGTCTGTGGGATCTGTTCAGTGGGGCTCTTCTCCATCGCTTCTGTGTACACGCTGGCGAGATCACACAGTTGATAGTACCACCACAGAATTGTACCCCACGAATTCAGAAGTGTATATGTTCCGTGGCATCTGACCACAGCGTGACCTTACTGAGTCTGTCTGAACGTAAGTGCGTGACGCTGGCCTCCCGTCACTTATTTCCGGTGGTGACCATCAAGTGGCGGCCGGCTGATGACTTTATGGTGGTAGGCTGTAGTGATGGTACTGTCTACGTCTGGCAAATGGAGACTGGACATCTTGATAGAGTACTGCATGGTATGATAGCTGAGGAGGTCCTTGGTGCGTGTGACGAAGCTGTCGCTGATGAATTAGGTGGAGCGGTCGGTGGTGGGGACCTGCAAGGCCTGGCAAATCCCGCGGTCCACTTCTTCAGGTACGACAAGGCCCACCTAGTATCTGGGGGTATAGATTTCGCTGTGTGTCTGTGGGATCTGTTCAGTGGGGCTCTTCTCCATCGCTTCTGTGTACACGCTGGCGAGATCACACAGTTGATAGTACCACCACAGAATTGTACCCCACGAATTCAGAAGTGTATATGTTCCGTGGCATCTGACCACAGCGTGACCTTACTGAGTCTGTCTGAACGTAAGTGCGTGACGCTGGCCTCCCGTCACCTATTTCCGGTGGTGACCATCAAGTGGCGGCCGGCTGATGACTTTATGGTGGTAGGCTGTAGTGATGGCACCGTCTACGTCTGGCAAATGGAGACTGGACATCTTGATAGAGTACTGCATGGTATGATAGCTGAGGAGGTCCTTGGTGCGTGTGACGAAGCTGTCGCTGATGAATTAGGTGGAGCGGTCGGTGGTGGGGACCTGCAAGGCCTGGCAAATCCCGCGGTCCACTTCTTCAGGGGTCTTCGTCATCGTAACCTGACAGCAATGAGAGCGGCCACCCAGCGAGGTCTGGCGGCGCTGCAGGTGGCTGATCGTTCCGGAAACGCCGGTCCACTGGCGGAGCCCGCGCGGGCGAGACGAGCACCCCTCACAGTACAGGGCTTCCGGTCCAACCCCGCCGATCCTGAGAGTCATATCCTGTTCTTCGACATCGAGGGTCTGATCGTGGAGCTACTGAGCGAGGAGTACTCCGCCATGAGCCCAGCGAGTTTGGAAGCTGCCGGTCTCATAACCAGTGAATACATGAAGGTGGCAGCCCTGACGCAGAGTGCCAGCCCTGACGCAGCCAAGCGGATATCCGACTTCTTTGGGAGAGTGAAAGACAAGGCCGGAGATATGGAAAGACTGTTGAAGGAAAAGGACAAACATGGTATTCTCGCAAAAATGAAGGAAGGGGCGGAAACTATGCAGACTAAGTTACAAGCGAAAGCGGAGCAATTGAAGGGACAGATGGAGCATAAAGAGTATGCCTACGCCCTAGCCAAGGCAGTCGAGATTCAAGTAAACAAGCCCCTCTTGAACACGAATGAAGTAGACTTTAAGTATCAGACAATAAAGATCATATTCGAGTGA

Protein sequence:

>DPOGS212483-PA
MPGTNLIVPVVLWGKHAPTHCISSVYLSRDQKTLVTGCYDGQICIWQVDPDSLKMIPRCLLVGHTAPVTCLSRASIIQDNNFIVSSSEAGEMCTWDLVDGKCRESVKLTQIHTNIQMIFLPKSASTSNCSNAQSANTRRCLWSLIMDPFSLEILFCLSSKMNPDWISALHVLRPSSRKDDVVLAISITGTVKVWSLLGHENKQSEPIYENESKQIRCLNALSLNCCAYNQRTVLIVCAKYCQIYDAGDFSVLCSIQAPSGERWMSGDFLAPDRVLVWSTEGKGYLYKLPANSVPDHKGFHGPTVEHDSPVLFSTLAHPDNKLLSCPPAMRFILTSVGGKQRRLLLRGDSHGVVSLWNVDDAATPQIKNVSSHAPIVMTSLEMAWAEMNPSPVGILDQFGHPDEPVGGKQRRLLLRGDSHGVVSLWNVDDAATPQIKNVSSHAPIVMTSLEMAWAEMNPSPVGILDQFGHPDEPEIKLTSSIYLLAANRLVVGREDGSIVIVNATHTVQLQLLHGNHQQLNDWPPHQLLLGHNGRVNCLLYPHYVNYRYDKAHLVSGGIDFAVCLWDLFSGALLHRFCVHAGEITQLIVPPQNCTPRIQKCICSVASDHSVTLLSLSERKCVTLASRHLFPVVTIKWRPADDFMVVGCSDGTVYVWQMETGHLDRVLHGMIAEEVLGACDEAVADELGGAVGGGDLQGLANPAVHFFRYDKAHLVSGGIDFAVCLWDLFSGALLHRFCVHAGEITQLIVPPQNCTPRIQKCICSVASDHSVTLLSLSERKCVTLASRHLFPVVTIKWRPADDFMVVGCSDGTVYVWQMETGHLDRVLHGMIAEEVLGACDEAVADELGGAVGGGDLQGLANPAVHFFRGLRHRNLTAMRAATQRGLAALQVADRSGNAGPLAEPARARRAPLTVQGFRSNPADPESHILFFDIEGLIVELLSEEYSAMSPASLEAAGLITSEYMKVAALTQSASPDAAKRISDFFGRVKDKAGDMERLLKEKDKHGILAKMKEGAETMQTKLQAKAEQLKGQMEHKEYAYALAKAVEIQVNKPLLNTNEVDFKYQTIKIIFE-