Monarch geneset OGS2.0

DPOGS201832
TranscriptDPOGS201832-TA1701 bp
ProteinDPOGS201832-PA566 aa
Genomic positionDPSCF300191 - 925612-928747
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0120310.081.75% 
BombyxBGIBMGA006086-TA0.071.53% 
DrosophilaCG6053-PB2e-14144.33% 
EBI UniRef50UniRef50_D6WDC82e-15947.64%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDC8_TRICA
NCBI RefSeqXP_971718.13e-16047.64%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastpgi|910929246e-15947.64%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastxgi|910929246e-15647.57%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055154.9e-26protein binding
KEGG pathwaytca:6603898e-160 
 K11143 (DNAI2)maps-> Huntington's disease
InterPro domain[162-487] IPR0110464.9e-26WD40 repeat-like-containing domain
[197-488] IPR0159431.5e-24WD40/YVTN repeat-like-containing domain
Orthology groupMCL10641 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201832-TA
ATGGAAATTTCATACCAATACACTAAGAAACGTTGTGATTTCGGACGCCAGGCACTGTTCTGTGAACAAGGCCCGGAGTTGTGCGACAGCATCACTACCAATGTTGCTGAACATAAGCACTACATCCTCCGTAACCCTGTACATGAGCCCATTCAGAATACGCCTTGTTTTTCCCAGCAGTTCATAAACACGATCCGTGCTGAATATACAACGACCGGAGCAAACCACGCGGAGGGAGGTTGGCCTAAGGACGTAAATGTCGTTGACCCCGAAGCAACTCAACGCTATAGAAGAAAAATTGAGAAAGATGACACATATATACATTGTGTTATGTCGCTGTCCCCGGGACTGGAACACTATGTCCTCCAAAACAACGCCATTGACATGTATCGCACGTATTACGCGGAGATGGCGTCTTTGCCCCCCATAGAGAAAAGTAGTTGTCGTACCGTGAACGTGTACCGTGATGTTGCGGCTGGTGGAGGCAGACCCATCAGCTCTATTTGCTGGCAGTCTGAAGGGGGTCATTGCTTTGCCGTCACTTACGTTGATGTCGATTTTAATCATATTGCTCGGGCACCAGTGGAGTCGTATATATGGGATATAGAAAACGCTAACTATCCGATATCCACACTGCTTCCGCCATCACCTCTGTTGGATTTACAATTTAATCCACGAGATCAAAATACGTTGATCGGAGGAATAATGAATGGACAAGTAGGTGTATGGGACAAACGGCAACATGGAGCCGCAGCTGGTTTGTGTGCTCCGCATGTTGCACATCGAGAACTAGTGCGGAATGTACTTTATATCAATTCAAAATCGGGTCAAGAGTTTTTTTCTGGCGGACCAGATGGTGCATGTAAATGGTGGGACATTCGTAATTTGAATGAACCGACAGACGAAATGATACTAGATGTTGTGAAGTCATCGTTTGATGTCCAAACGATGGCGAACGCAAACGGCGTCAGTACGCTGGAGTACGAGTCGACGATGCCGACTCGTTTTATGGCTGGAACTGAAAATGGGTTTGTTATTGGAGGAAACCGTAAAGGAAAAACACCGATGGAGAAATTGCCTGCTAAGTTCGAAGCTCACTTGGGTCCTGTTTGGTCATTGGAACGTAATCCGGGCTTTCTTAAGAATTTTCTCACCGTCGGAGACTGGACCGTTCGTCTGTGGAGTGAGGACTGTCGTGAATCAGCCGTGCTCTGGTCTCCTCCCCACCGGCATAAAGTTACAGCAGCCGCTTGGAGTCCCACACGTCTGTCACTTATGACTATGATGCAATGGAATGGTGTCATGGCTATATGGGACTTACTCAGGCGGCAACATGAGCCGGTGCTTACAATGCAAATTTGTGAAGAGCCGCTTCTAAGAGTGCGTATGCATGACGGTGGGACTTTGGCCGCTTGTGGCAGTAAAAAGGGAAATGTGTACATGGTGGAGCTATCACAGAATTTATCGCAGTCGGACAAGAATGACAAGGTTTTACTCACTGCGATTTTTGATCGTGAGAGTAAGCGCGAACGTATCTTGGAAGCGCGCATGCGCGAGTTCCGCCTGAAGATGCGTCAGGCTGAGGAAGGGAGCCCCATCGCCGTCGCCTCGGAAGTAGACCTGACTGTCGGTGACAAAGACCTGGCCGAAGCCACCGCCGACTACATGCAGCTCGTCAAGAAAGAGCTCGCCGCCATGTAG

Protein sequence:

>DPOGS201832-PA
MEISYQYTKKRCDFGRQALFCEQGPELCDSITTNVAEHKHYILRNPVHEPIQNTPCFSQQFINTIRAEYTTTGANHAEGGWPKDVNVVDPEATQRYRRKIEKDDTYIHCVMSLSPGLEHYVLQNNAIDMYRTYYAEMASLPPIEKSSCRTVNVYRDVAAGGGRPISSICWQSEGGHCFAVTYVDVDFNHIARAPVESYIWDIENANYPISTLLPPSPLLDLQFNPRDQNTLIGGIMNGQVGVWDKRQHGAAAGLCAPHVAHRELVRNVLYINSKSGQEFFSGGPDGACKWWDIRNLNEPTDEMILDVVKSSFDVQTMANANGVSTLEYESTMPTRFMAGTENGFVIGGNRKGKTPMEKLPAKFEAHLGPVWSLERNPGFLKNFLTVGDWTVRLWSEDCRESAVLWSPPHRHKVTAAAWSPTRLSLMTMMQWNGVMAIWDLLRRQHEPVLTMQICEEPLLRVRMHDGGTLAACGSKKGNVYMVELSQNLSQSDKNDKVLLTAIFDRESKRERILEARMREFRLKMRQAEEGSPIAVASEVDLTVGDKDLAEATADYMQLVKKELAAM-