Monarch geneset OGS2.0

DPOGS214952
TranscriptDPOGS214952-TA1854 bp
ProteinDPOGS214952-PA617 aa
Genomic positionDPSCF300280 + 98124-104679
RNAseq coverage410x (Rank: top 30%)
Annotation
HeliconiusHMEL0155908e-13279.86% 
BombyxBGIBMGA004849-TA4e-15268.03% 
DrosophilaCG1523-PA1e-9844.02% 
EBI UniRef50UniRef50_Q7Q0S94e-11939.84%AGAP010182-PA n=5 Tax=Culicidae RepID=Q7Q0S9_ANOGA
NCBI RefSeqXP_001664094.15e-14045.08%hypothetical protein AaeL_AAEL013887 [Aedes aegypti]
NCBI nr blastpgi|1571379589e-13945.08%hypothetical protein AaeL_AAEL013887 [Aedes aegypti]
NCBI nr blastxgi|1955034859e-11939.51%GE10493 [Drosophila yakuba]
Group
Gene OntologyGO:00055153.3e-38protein binding
KEGG pathway 
InterPro domain[520-608] IPR0159433.3e-38WD40/YVTN repeat-like-containing domain
[64-609] IPR0110461e-35WD40 repeat-like-containing domain
[102-134] IPR0197813.7e-07WD40 repeat, subgroup
[96-134] IPR0016809.6e-06WD40 repeat
Orthology groupMCL11252 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214952-TA
ATGGGGACAAATAGCATTGACGATAAGTACTTTATGGCGAGGAATTCCGCATATAATCCGTACTGGCTAACTCGACGAGAAATCGGGTTCAATAGGCCTGTGGGTTCTGGTAATGCGCTGGCTCGTAGCCTTTATTGTGGTATGAAACCCATAGCTTCTTGGGATTGTGAACAAGCTAATTCTCTTCCCACCGGTGGCGTTTTCAACCTTGAATTTTCCCCGGAAGGATCTCTTTTAGTGGCAGCTTGTGAAAAGAAATCAATACAAATATTTGATCCACTAACACACCAACGAATATATTCTGTTATTGGAGCCCACTCAGACTGTGTTAATTGTGTTAAATTTCTTGATGGCAGAATGTTTGCGACTTGCTCTGATGATACCACAATCGCTTTATGGGATGTAAGAAATTTAAAGAAGAAAATACGTTCTCTATTGGGTCATTCGAATTGGGTTAAGAACATAGAATTTTCTGTGAAAGATAAACTACTAGTCACATCAGGTTTGGATGGGAGTATATATACGTGGGACATAAACTCATATACCGAATTCAATCTGGTTTACCAAAGAGTATTCCATGCCTCTGGTCTCATGAGGTGTCGCCTGTCACCAGATGCGAAGCAAATGGTGATGTGCACTACTGGAGGGCTCTTAGTTATAATACATGATTTAAATTTGACAACTTTAGCACAAGATCTGCACGGCTTTAAGGTTAATATTTTTTTTATTGACTCTCAAATTATTTTCTCTATGGATTTTATAATACCAATAGCCGCTATGTATGATCACTTGTTTGATCTTGAACGCAAGGAGAATAGAGTCGAATTTGTGTCTGATTTTCCTGATGGCAATGATGCTGAGGTAGTCAGTGCATTGCAGATTCATCCTCAAGGTTGGTGTGCATTAAGCCGTAATATCAGTCACGATGATAGGTCAGAGTGGTCCTGCATACACGACATCCAGCCCGGTGAGAGTATGACAAATAAGAGTTGTAGAGACACGTCCCCGTCACCACCAGCGCCGCCCCGTAGGCCTGCGCCCCGGAGGCCCAAGAGGGTCCGCAATCGGCCCTACAGGCACCTGCGACCCATGACGAGCATCACAACCCAGGGCGACACACCCCCTTCGTCGTCCAGGCCTCGTGAGTCCGAGGACTCGGAGGCCGGGCCCAGTCGAGAGCCAGCGCCCGCGAGACTCCCTCCACCGAATCTATCGAGTATACAAAATGACGTGTGGGAGGCGTCTATAACGATCAAACAACACCGAATACTGCAAGAAATGTATAGCAGGGGTCAGGTCGGGCGTAATTTCAACATGCAGCGCATCATGGGCATCAACACTGGCATCACCCCCCCGGGCGCGGTCCGCCGGCCCGCTCGTCCTGCGAGACTCCTGCCGCGGCTGCACGCATCACCTGTCCCCCTCGCACCCACACCCGACAGTGTCCTCCCCTCCACGTCCGGTCAGACGCCACTCCGGGACTTGTATGATGACGAATACAAACACTTTATAAGACAGAACAGAGACAGGTTACTATACTACATAGAGGAAACGAATGAAGGCAAAGGCTTCATAAAGGAGCTGTGTTTTTCTGCTGATGGTCGCCTGGTTTGTTCGCCTTTCGGCCGAGGCATGCGTCTGTTAGCTCTCAATGAACAGTGCGCGGAGTTATCCCACTGTGTGCAGGACTTCAAGGGTCCGTCTCGTATGGTAGACGTGGGTCAGAGCCTCGGCTTCCATCACGACCTCGTGGTCAGCTCCAAGTTCAGCCCCCGCCACCACTCCCTGGTCACCGGATGTCTCGAAGGAAAGATCGTCTGGTATGAGCCCTACAGCGGCGAGTCCTGCTACTAG

Protein sequence:

>DPOGS214952-PA
MGTNSIDDKYFMARNSAYNPYWLTRREIGFNRPVGSGNALARSLYCGMKPIASWDCEQANSLPTGGVFNLEFSPEGSLLVAACEKKSIQIFDPLTHQRIYSVIGAHSDCVNCVKFLDGRMFATCSDDTTIALWDVRNLKKKIRSLLGHSNWVKNIEFSVKDKLLVTSGLDGSIYTWDINSYTEFNLVYQRVFHASGLMRCRLSPDAKQMVMCTTGGLLVIIHDLNLTTLAQDLHGFKVNIFFIDSQIIFSMDFIIPIAAMYDHLFDLERKENRVEFVSDFPDGNDAEVVSALQIHPQGWCALSRNISHDDRSEWSCIHDIQPGESMTNKSCRDTSPSPPAPPRRPAPRRPKRVRNRPYRHLRPMTSITTQGDTPPSSSRPRESEDSEAGPSREPAPARLPPPNLSSIQNDVWEASITIKQHRILQEMYSRGQVGRNFNMQRIMGINTGITPPGAVRRPARPARLLPRLHASPVPLAPTPDSVLPSTSGQTPLRDLYDDEYKHFIRQNRDRLLYYIEETNEGKGFIKELCFSADGRLVCSPFGRGMRLLALNEQCAELSHCVQDFKGPSRMVDVGQSLGFHHDLVVSSKFSPRHHSLVTGCLEGKIVWYEPYSGESCY-