Monarch geneset OGS2.0

DPOGS204855
TranscriptDPOGS204855-TA1719 bp
ProteinDPOGS204855-PA572 aa
Genomic positionDPSCF300227 + 193677-197334
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0138950.084.65% 
BombyxBGIBMGA011752-TA0.080.19% 
DrosophilaCG6053-PB4e-15447.80% 
EBI UniRef50UniRef50_D6WDC82e-16151.13%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDC8_TRICA
NCBI RefSeqXP_971718.13e-16251.13%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastpgi|910929246e-16151.13%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastxgi|910929242e-15851.13%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055159.5e-29protein binding
KEGG pathwaytca:6603891e-161 
 K11143 (DNAI2)maps-> Huntington's disease
InterPro domain[167-491] IPR0110469.5e-29WD40 repeat-like-containing domain
[166-489] IPR0159439.8e-26WD40/YVTN repeat-like-containing domain
Orthology groupMCL10641 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204855-TA
ATGGAGAAAACAGACAAAGCAGAATTCGTCTATGAATATACGAGACGCAGAAGAGAATTTGGTAGACAAACTCTTTTTGAAGACCGAAATGCAGAGCTTAGTGTAAGCATACCTTCAAATCCGTCAATGTACAAACATTATATTTTAAGAAATCCTGTTAATGTGAGCGTTGAAAATACCAAGTCTATGTCCGAGCATTGGGTCAATTCTGTTAGAGCTGAATACACAAGCTCGGGAGTTAACCATGTGGAGGGAGGCTGGCCGAAAGACATCAACATCAACGATCCCGAAGCAACTCAACGATATCGGAGAAAGATAGAGAAAGACGATGCTTACATACACGCAGTCATGCATCTCGGCCACAGTATGGAACATAATATTCTTCAGAACAATGCAGTGGATATGTATCAGATTTACTATTCTGAATTGCCTTCGATACCACCGGTGGAGCGAAGTAGCTGTCATACCGTCAATGTTTATAGGGAACCAGGAGCTAACAGAAGGCCTATTCGTTCTTTATCGTGGCAACAAGACGGAGCCCGACTTGCAACAGCTCACGCTGACATCTGTTTCACACGAAACTCTAGAAATCTTCAGTTATCTTACATTTGGGATATAGAAAATGCAAATGCTCCTGAATTGACAATAACTCCGCCGCAGCCTCTCCTTGATTTGCAGTACAATCCTCGCGATCAACATATTCTTGTCGGAGGTCTTATGAATGGTCAAGTGGGATGTTGGGACATGCGCCGAGGCGGTGAAGTCATTGCTCTGTGTCCACCTCACGTAGCACACAGAGAACTAACGAGAAATGTCCTCTTTATTAATTCAAAAACTGGAGCAGAATTCTTCTCGGCGTCTCCTGATGGTGTCGTAAAGTGGTGGGATACAAGAAACATGAGTGAGCCGACAGATTTTATGATCATAGATCCTGTTAAAACCAACAACGATACTCAGAGTGTTGAAAATGCACTCGGGATTTCTGCCCTCGAATATGAACCGACAATACCTACTCGTTTTATGATCGGTACCGAGACTGGGCTAGTCGTTGGAGGGAATAGAAAAGGAAAAACTCCCTTGGAGAAACTACCTTCAAAGTACGAGGCACATCTTGGTCCTGTATATTCACTGCAAAGAAATCCAACGTTTCTGAAAAATTTTTTAACAGTTGGTGATTGGACAGCGAGAGTTTGGAGTGAGGACTGCAGGGAATCTTCTATACTGTGGACACATTCGCATAGAACTAAATTAACCGATGGGGCATGGAATCCAATAAGGTTCTCTCTAATGCTGGTGTCTCAGTGGGACGGCTGTCTGTCCTGCTGGGACCTGCTCAGACGCCGCACGGCTCCCGTCGTCACAGCACAGCTCTGTGACGAGCCCTTGTTGAGAGTACGACCTCACGAGGGGGGTCTCTTGGTGGCGTGCGGAAGTAGTAAAGGTACCGTTTACTTGGCGGAACTTTCAGAGAACCTCGGCACAGCTGACAAAAATGACAAACAACTTTTGACCCATATCCTAGAACGTGAGAACAAACGCGAGCGTATCCTGGAGGCGCGCTTGCGAGAATTGCGGCTCAAGATGAGGCAGGAGCGAGACGTGGGGCATCCGGACGGAGATTCCGCCCCCACTGACAAAGACCTGGAGGAAGCAGCCAATGAATACTCACTGATGGTGCAACAACTGCAGGAACAACAGAACAACCAGGACTTGGGATAG

Protein sequence:

>DPOGS204855-PA
MEKTDKAEFVYEYTRRRREFGRQTLFEDRNAELSVSIPSNPSMYKHYILRNPVNVSVENTKSMSEHWVNSVRAEYTSSGVNHVEGGWPKDININDPEATQRYRRKIEKDDAYIHAVMHLGHSMEHNILQNNAVDMYQIYYSELPSIPPVERSSCHTVNVYREPGANRRPIRSLSWQQDGARLATAHADICFTRNSRNLQLSYIWDIENANAPELTITPPQPLLDLQYNPRDQHILVGGLMNGQVGCWDMRRGGEVIALCPPHVAHRELTRNVLFINSKTGAEFFSASPDGVVKWWDTRNMSEPTDFMIIDPVKTNNDTQSVENALGISALEYEPTIPTRFMIGTETGLVVGGNRKGKTPLEKLPSKYEAHLGPVYSLQRNPTFLKNFLTVGDWTARVWSEDCRESSILWTHSHRTKLTDGAWNPIRFSLMLVSQWDGCLSCWDLLRRRTAPVVTAQLCDEPLLRVRPHEGGLLVACGSSKGTVYLAELSENLGTADKNDKQLLTHILERENKRERILEARLRELRLKMRQERDVGHPDGDSAPTDKDLEEAANEYSLMVQQLQEQQNNQDLG-