Monarch geneset OGS2.0

DPOGS214456
TranscriptDPOGS214456-TA4974 bp
ProteinDPOGS214456-PA1657 aa
Genomic positionDPSCF300441 + 18594-27614
RNAseq coverage719x (Rank: top 18%)
Annotation
HeliconiusHMEL0044350.079.18% 
BombyxBGIBMGA009575-TA4e-16664.95% 
DrosophilaBRWD3-PA4e-14140.21% 
EBI UniRef50UniRef50_E0VV610.042.37%WD repeat domain-containing protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VV61_PEDHC
NCBI RefSeqXP_395863.30.043.25%PREDICTED: similar to CG31132-PA [Apis mellifera]
NCBI nr blastpgi|3838624270.043.90%PREDICTED: PH-interacting protein-like [Megachile rotundata]
NCBI nr blastxgi|3227923130.038.57%hypothetical protein SINV_04817 [Solenopsis invicta]
Group
Gene OntologyGO:00055151.2e-56protein binding
KEGG pathway 
InterPro domain[121-447] IPR0159431.2e-56WD40/YVTN repeat-like-containing domain
[121-455] IPR0110462.3e-53WD40 repeat-like-containing domain
[1134-1273] IPR0014874.4e-31Bromodomain
[178-217] IPR0197817.6e-07WD40 repeat, subgroup
Orthology groupMCL10768 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214456-TA
ATGGAGGATTCCACCGAACCGGTCAATGTCAGTCCTGAACTCTACTTTTTGATAGCGAAATTTCTCTCCGGTGGACCTCTGAAAGAGACTGCAAAGACTTTGTTGAAAGAGCTGGCGAGCGCGGAGGTGCTACCGCAACGTATCGACTGGGAGGGCAACGTTCATTCACAGTCGTACGATGAACTGGATTCACAATATTCCGAGATATCATGGAAGCACTTGGTATTCGTATGCGAACGAGCCCTGCAACTGGCTCGCTACTCGACGGCACCCGCTATCGCTTTGCCAGGACCAGAGAACACGGCACAGATCGCACTACAGAACGGATGCGTCGTCTGGTGTGCCGCGAGCTGGGCGGCGGCGGCGGACTTGCTGGTCAAGGTGTGGAGCGCGCTGGACGGGCGACTGTGCGCCACGCTGAGGGGGGCCGGCGCAGAGATCACCGACGTGAGCGCCTCGTGGGACGGCGCTCTCCTGGCGTGCGGCTCGGTGGAGCGTCTGGTACGCGTGTGGTGCCTGGCGACCGGCGCGCCACGAGCCGTGCTGCAAGCCCATGCCGGAACCATCACGTCGGTGGCCTGGTCGCCGCCTGCTCACAGGGAGCTGCGCTGGCTGGCCTCCACAAGCACGGATGGGTCCGTCGCCTTCTGGACGTGCTCCTTGGACGGACACTTCCTGTCTCAACCCGTCCAGTACGTGGAGCGCCTCCGGCCGGGCGCCTGTCACATGATCTGCGCCGCTTGGTCTGCGGGTGGCGCCTTCCTGGCGGCGGGCTCGGCCGACCAGCGTGTCCGCGTGTACGCGCTCGAGGCCGCCGGGCCGCGCCGTGTGCTCGAAATGGCCGCCCACACGGACGCCGTGGACTCTCTCGCTTGGGCTCACCGTGGCCTACGCTTCGTGTCCGGCAGCAAGGACGGGACGGCGGCTCTCTGGACTCTACACGCCACACAGTGGAGGCATACCTTATTGCGAGCTCAGCAACCCGAGGAGCCGCGGAAGTTGAAAGTGACGATGGTGTGTTGGGACGCGAGCGACGAGTTCGTGGTGACCGCCGTGTCGGATCGGACGCTCCGTGTGTGGTGCTCGCGGCGCGGGGAACAACTTCGCTCGCTGGCGGGACACCGGGACGAGGCCTACGTGCTGGAAGCTCACCCGCGCCTGCCGGGGGTGCTGCTCTCGGCCGGACACGACGGACAACTGTTCGTGTGGCACGTGCTGGAAGAGGAGCCGCTGGCGCGGTTCCACAACGAGATCGAGGGTCAGGGCGAGGGCGCCATCTTCGACGCCAAGTGGGGCGGCGGTGGCGCGACCGTCGCCGCTGCGGACTCTCATGGACACTTGGTGCTGCTGGGTCTGGGCGCCGGGCACCGCCTGCTGTCGGAGCTACCTCGCGAGTTGTTCTTTCACACGGATTACCGGCCCCTGGCGCGAGACGCTCTCGGCGGCGCTCTCGATGAACAAACAGAGCTGCCTCCACACCTGATGCCGCCGCCTTTTCTGGTGGACGTGGAGGGAGCGCCCCACGCGCCCCGCCATCAGCGGCTCGTACCGGGTCGTGAAGCGCTACCGCTGGAGCAACTGGTGCCCCGCACGAGACTGGACGCTATGATCGAAGCGCTGGCCGCCGACCCTCTGTCGGGAGGCGCGGGAGGGGCCGAGGGCGAGGCTGCGGGAGGCCATGGAGTGTGGCGAGGGGAAGGCGTGCGCCACACGGCGGGCTCGTGGCAGGGTCTCGACGTGCCGCTCAGCACTAGGCCTATCGTCCCTCCCCTCGCGCCCTCCGCCAGGGACGCCCTCGAGAAAGCCAGTGTCGAGTTGAACGCGTACGAGATGTCGTGGTACAGGCGGGAGATGCGGCGCCGCCCCGTCATGATCAGTACGGCGGGCGAGGGTCCCCGCCGCCGGCCAGGCCGCCGCCCGCGCCTGCCGCAGCCTCCCCGCACCCGTCCGCCACCGCCCAGGGCTCAAGTTATACCGATCCGCTTAGACGACGACCCGCCTGAGGTAGCAGACGACAGCGCGGACACGTCCGACGAGTCCGTCCGCCTGTCCGGCTCCCAGTCGTCCCGCTCGTCTCGGTCCCGCTCGTCCCGTACATCCCGCTCCTCCCGCTCCTCACGCACGAGACACTCCAGCGCCGAGCTGTGCGACTCGGACAGCTCACATTCTCCCAGCAAGACGGACAGCTCTTCCAGCTCGCAGTACTCGGACTGGGAGGGAGGTTCGGCGCTGTCTCCTCCGCAGCGCGCCCGGAGACGCCCGGCTCGCCGGCCCCCCGCCGCCGCCAAGAGAGTTCCTGCGCCCTGCGCCTCCGCCCCCGACCTCCCGGAGGCCTACCGCGTCGGCGAATGGCTGACCGCCGTGTCCCCCAGGAAGGCGCCCTACCACCCGCAGATGGGAGACAGATGTTTGTACTTCCGACTAGGTCACCAGCGGTACTTCGAGGCGGTCGCGGAGAAGGATCTGTACAAAATAAACAGCAGGGACAAGCCCTGGGAGAGAATGCAGATATATGAGTGCGAGGCGGTGCAGGTGGTCGGCATCAAGTACGTGATCCGTCCTCCGCGGGTCGCCAGTCTCAAGCTGGTGCGCGAGAGAGACGCGCGCTCCTTCACCGTCCGCTACCACGACATGCCGGACGTCATTGACTTTCTGGTACTAAGGCACCACTACGATGCGGCGGTCGCCAGGCGGTGGGGCGCCGGGGACAGGTTCCGTTGTATGATCGACGACTCATGGTGGACCGGACAGGTCGTGGAGAGGATGACGGCTGACGAGGGCGGCGCCGCGGACAGAGGTAGCTCCGACGACAGGGGGGGCGCCTGGGCCAAGGAGGCTGCCGGTCACTTCCTGTCGCTGAGAGTGAGGTGGGACAACGGGGAGGTGGAGAGACTGTCACCCTGGGACCTGGAGCCCATAGACCCGGAGAGACTGCCGTCGGAGCCTGGGGGCGCGGTGCCGGTGCTGCCCCGCGAGCTGGAGGCCGTGTTGCTGAACGAGTGGCCTCCACAGGCCTGTAGGACCATAGCACAGCATATCTCACAGGTGATGTCTCTGTCCGTGGCGGAACCGTTCGTGGCGCCGGTGGACCTGCAGCTGTATCCCTCGTACGCCCTCGTGGTGCCCTACCCCGTGGACCTGGCCACCATCCGGGCGCGCTTCGAGAACCTCTTCTATCGTCGCGCGGCCGCCGCTCAGTTCGACGCTCGCTGGCTGGCGACCAACGCCGAGCGCTTCAACGAGCCTCGCTCGCCCATAGTGAGGCAGGCTAGACTCGTCACCGACCTACTGCTCAATATTATACGGTCGTGGGAGCAGGTGGACGTGGTGGCGCAGTACCACGAGCTGGCCGCCTCCTACCACTCGGACGACGAAGACGACGACGTAGCTCATAAGAAGCGTCCGACACCCCGCGGCGTGGTGCCTCACACGGGTCATCAGCGGGCGGGTCCCCCGAGGGCAGCGCCCCCCCGGGATTGGCGCTCCGCCTGCTCCGCTCTACTGCAAGAGCTCACCGCCAGCGCTGACGCGGAGCCCTTCAGACACCCCGTATCGTCCGCGCAAGCCCCCGACTACCGCAGTGTCGTGACTCACCCCATGGACCTGGGCACGGTGAGCCGGCGCCTGTCTCAGGGTCACTACTCGCGCTCCGACCAGCTCGCCGCCGACGTCAGGCTGGTGTTCGCCAACAGTCGCCTCTACAACACGAACAAACGCAGCCGGATATATTCTATGACCGTCCGGTTGTCATCGTTGTTTGAGTCGCTGTGGTCGAGACTCCCGGTCGAGATACGGAGCACCAGGGAGGGGAGGAACAGGGGCAGAGACAGAGATAGAGGCCGGAGGACCAGGGCGAGGAGGTGTCGGCAGGGGGGAGGGGGGGAACCACTTGATGCAGAGCCGCGGCCCGGGCCCGCAGTCGTCCGTCCACCCTCGTCCGCGAGCTCGGAGTGCGAGCCGCTGGCGGCCACGTCTCGGAGGGTTCACGCCTTGAGAGAAACGAGCCACACCTCGAGCCACGGCTCTAGCGGCCCGGGCTCGAGCCGACACGACAACGGCTCGGGCTCGAGCACACGAGTCGAGCGCCCGCCGCCCGCCAGGGCCGACTCCTGGGACAGCGACGTACCGCTCACCTCGCACAGGAAAGGGAAAGGAGTCGGGAAGAAGAGCAAGAACACCTCCACCGCCTCCAGCAGCCAGGCGCCGCCGCAGCTGGCGGAGGAGTGGACGAGCGCGCCGGAGAGCGAGTCACCGACGCACATAGAGGTGGTGGAGGAGGAGCTGGTCGAGGACGAGGTGGAGTTGGAGTACGACGAGAACACGCGCACCAGCGAGGGCCCCAGGGGACCCGCGGGGACGCGTGAGTCCGTCAAGCGGCGGCCGCGCAGCTCGGGCTCCAGCGACGACAGACCGAACAAACGAAGAGTCCGCGAGCCGTACTCGTCGGGCTCGTGCTCCAGCTCAAGTTCGGAGTCGGGCTCGGGCTCCGCCTCGGGCTCCGGCACCGGCTCGGGCTGGCGCAGCGAACACTCCGCCAGACGAGTGAGATACGAGTCGGACCGCTCATACAGACCCAACAACAACTACTCCACCGACGACGACGCGCCTCTACTGCACTACAGACAGCGGCAGGGCTCCGGTCCGGGCTCGCGAGGAGGCTCGCTGTCCGGTTCACGGTCGGGCTCGACGTCGCGGAGGCCTCTCCGACGGCGCGGCTCCCCGCGGCGGTACAACGAAGATAGCGAAGACGACTCCGTGGCCGCCATCAGCAAGAGACTGCCGTCACATCGCCACACCTACAACCACAACCACAACAACCACGTGGCGACCACCAGCAACGACCACGACTACTACAACGGCCACGGGCCTCCGCGGTCCGGCTCGGGAGGCTCCGGGGCCGGGCCCGTGTCCATCTCCTCCCGTGGCCGCGTCCGCCGACTGACGGCGAAGGCCCGCGGTCTGCTGAGACCTTGA

Protein sequence:

>DPOGS214456-PA
MEDSTEPVNVSPELYFLIAKFLSGGPLKETAKTLLKELASAEVLPQRIDWEGNVHSQSYDELDSQYSEISWKHLVFVCERALQLARYSTAPAIALPGPENTAQIALQNGCVVWCAASWAAAADLLVKVWSALDGRLCATLRGAGAEITDVSASWDGALLACGSVERLVRVWCLATGAPRAVLQAHAGTITSVAWSPPAHRELRWLASTSTDGSVAFWTCSLDGHFLSQPVQYVERLRPGACHMICAAWSAGGAFLAAGSADQRVRVYALEAAGPRRVLEMAAHTDAVDSLAWAHRGLRFVSGSKDGTAALWTLHATQWRHTLLRAQQPEEPRKLKVTMVCWDASDEFVVTAVSDRTLRVWCSRRGEQLRSLAGHRDEAYVLEAHPRLPGVLLSAGHDGQLFVWHVLEEEPLARFHNEIEGQGEGAIFDAKWGGGGATVAAADSHGHLVLLGLGAGHRLLSELPRELFFHTDYRPLARDALGGALDEQTELPPHLMPPPFLVDVEGAPHAPRHQRLVPGREALPLEQLVPRTRLDAMIEALAADPLSGGAGGAEGEAAGGHGVWRGEGVRHTAGSWQGLDVPLSTRPIVPPLAPSARDALEKASVELNAYEMSWYRREMRRRPVMISTAGEGPRRRPGRRPRLPQPPRTRPPPPRAQVIPIRLDDDPPEVADDSADTSDESVRLSGSQSSRSSRSRSSRTSRSSRSSRTRHSSAELCDSDSSHSPSKTDSSSSSQYSDWEGGSALSPPQRARRRPARRPPAAAKRVPAPCASAPDLPEAYRVGEWLTAVSPRKAPYHPQMGDRCLYFRLGHQRYFEAVAEKDLYKINSRDKPWERMQIYECEAVQVVGIKYVIRPPRVASLKLVRERDARSFTVRYHDMPDVIDFLVLRHHYDAAVARRWGAGDRFRCMIDDSWWTGQVVERMTADEGGAADRGSSDDRGGAWAKEAAGHFLSLRVRWDNGEVERLSPWDLEPIDPERLPSEPGGAVPVLPRELEAVLLNEWPPQACRTIAQHISQVMSLSVAEPFVAPVDLQLYPSYALVVPYPVDLATIRARFENLFYRRAAAAQFDARWLATNAERFNEPRSPIVRQARLVTDLLLNIIRSWEQVDVVAQYHELAASYHSDDEDDDVAHKKRPTPRGVVPHTGHQRAGPPRAAPPRDWRSACSALLQELTASADAEPFRHPVSSAQAPDYRSVVTHPMDLGTVSRRLSQGHYSRSDQLAADVRLVFANSRLYNTNKRSRIYSMTVRLSSLFESLWSRLPVEIRSTREGRNRGRDRDRGRRTRARRCRQGGGGEPLDAEPRPGPAVVRPPSSASSECEPLAATSRRVHALRETSHTSSHGSSGPGSSRHDNGSGSSTRVERPPPARADSWDSDVPLTSHRKGKGVGKKSKNTSTASSSQAPPQLAEEWTSAPESESPTHIEVVEEELVEDEVELEYDENTRTSEGPRGPAGTRESVKRRPRSSGSSDDRPNKRRVREPYSSGSCSSSSSESGSGSASGSGTGSGWRSEHSARRVRYESDRSYRPNNNYSTDDDAPLLHYRQRQGSGPGSRGGSLSGSRSGSTSRRPLRRRGSPRRYNEDSEDDSVAAISKRLPSHRHTYNHNHNNHVATTSNDHDYYNGHGPPRSGSGGSGAGPVSISSRGRVRRLTAKARGLLRP-