Monarch geneset OGS2.0

DPOGS207476
TranscriptDPOGS207476-TA1518 bp
ProteinDPOGS207476-PA505 aa
Genomic positionDPSCF300051 + 271516-275409
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0052323e-12263.78% 
BombyxBGIBMGA000949-TA5e-7458.06% 
Drosophila% 
EBI UniRef50UniRef50_D2A1921e-6632.36%Putative uncharacterized protein GLEAN_07126 n=2 Tax=Tribolium castaneum RepID=D2A192_TRICA
NCBI RefSeqXP_975141.12e-6732.36%PREDICTED: similar to Bardet-Biedl syndrome 7 [Tribolium castaneum]
NCBI nr blastpgi|910815634e-6632.36%PREDICTED: similar to Bardet-Biedl syndrome 7 [Tribolium castaneum]
NCBI nr blastxgi|1583007936e-6532.62%AGAP011899-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055156.9e-13protein binding
KEGG pathway 
InterPro domain[32-313] IPR0110466.9e-13WD40 repeat-like-containing domain
[163-316] IPR0159432.4e-11WD40/YVTN repeat-like-containing domain
Orthology groupMCL16186 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207476-TA
ATGAGTTACGATTTATCACGAGTTGATTACACTTTATGTGGAATAACTTATCCTGATACACTTAAAATATTGCCAGCATCAGATCAAAAGCTCAAACAAAGGTTTGTAGTTGGCGATAAAAATGGAGTGCTTCAATGTTTAAGCATAAAGGATGAAGAACCGGTAGTGAACTTTAAAACACTTCCTGGAAAACCAATTACATGTGTACAATTAGGCTCACCTGCGGGAAACACATCCGACAAAATATTTACGGCTTCCGGAAATGAGGTTAAAGGTTATAATAAGAAGGGCAAGGTGTTTTTCGCGATTGAAACTTCAGTATCAGAAACCATCACTTCTATGTCCGTTATTGGAAATGACTTGATATTGTGTAGCGGAAGAACTATTACATTTTATAGGGACCTACAAGAATTATACACATACATTTGTGATGATAGAGTACTTGACTCAACACCATTCAGCACTCCCAATAGCGTCCGTGTACGTCTGCTGGTTATAATTGCAAATAAAGAAGCATTAATAGTTGAGAACGGCAAGCTTCTGCAAAAACTCTATGTATCTGCCGGACCTACCAGCATAACTGTTCCACCATTTATAGGTCATGTGGATGTGTCTGCGATTTATGGCTCCGCTGATGGTTCGATCGGTGTCCTAAGTTACCAGGAGTCAGAATTAAACAGTAAATGTTTGGTCCCGGGAGCTGGGTTGGGTTCTGTCACATGTCTTGGTTGGCTCAATAACAGCTCTGGCTATCACCTGGCTGTCGGACGTCACGATGGTTCAATACAACTTCACTTGCTCAATAGTGAAAATTTAAATGACAAGCCAAGGCTGAAGTTCACTTATTTTTCAGGGGAACCTGTTACGTCTGTTTGTGGTGGTGTTATTAGTACGGATGAACCTGAGTTAATAGCCGCCACTTTCTCTGGAAGGATTTTCGGTTTGAGATCTAATCGATTCATGCCAGGAAACATATCGAGTGCCTCGCTGGATGTTTTAGCTACCCGACGCTCAAAACTTGAAGCTGAAGTTGCTAAATTGGAAAAACAGGCGATCAGTGAACGGGAAAAATATCAAAGAAATACACGATCTTTTCTTTCTGGTGTATCAGTTCCACCGCTTTTGGAAATCGAGTATGAGCTGACTGGCGCTACTCACAACAATTGGCAGGAAGTAAAACTCATATCAGCGGTTCCCTTGGACGTTCTCTTCGTGTATTGTGAAAATAAACTTGAAATACAGACGGATAGCACTGCCGTACTTAGCATTTGTACTCAACAGGAAAACAACAGCCCTGAACTTCTAGCGACAATTCGTTGCCAAGCTGGCACAAGAAGATTATGGATGCGCATACGAATTTTAGACGATAAAGAAACAAAAATGGAAGGCACAAGAGTACTTATATATGTGTTGCCTACAGGAGCTCCGAGAGTGGCTCGACTTATAAAACTATATTTGTCGTTATTGCCACACTATTCTAAATATGAATCACCTGACATAGAAAACAAGCAAAGGTAG

Protein sequence:

>DPOGS207476-PA
MSYDLSRVDYTLCGITYPDTLKILPASDQKLKQRFVVGDKNGVLQCLSIKDEEPVVNFKTLPGKPITCVQLGSPAGNTSDKIFTASGNEVKGYNKKGKVFFAIETSVSETITSMSVIGNDLILCSGRTITFYRDLQELYTYICDDRVLDSTPFSTPNSVRVRLLVIIANKEALIVENGKLLQKLYVSAGPTSITVPPFIGHVDVSAIYGSADGSIGVLSYQESELNSKCLVPGAGLGSVTCLGWLNNSSGYHLAVGRHDGSIQLHLLNSENLNDKPRLKFTYFSGEPVTSVCGGVISTDEPELIAATFSGRIFGLRSNRFMPGNISSASLDVLATRRSKLEAEVAKLEKQAISEREKYQRNTRSFLSGVSVPPLLEIEYELTGATHNNWQEVKLISAVPLDVLFVYCENKLEIQTDSTAVLSICTQQENNSPELLATIRCQAGTRRLWMRIRILDDKETKMEGTRVLIYVLPTGAPRVARLIKLYLSLLPHYSKYESPDIENKQR-