Monarch geneset OGS2.0

DPOGS212556
TranscriptDPOGS212556-TA1719 bp
ProteinDPOGS212556-PA572 aa
Genomic positionDPSCF300075 - 507344-509062
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0120790.067.14% 
BombyxBGIBMGA012277-TA0.065.09% 
DrosophilaCG10383-PA3e-6932.73% 
EBI UniRef50UniRef50_UPI00021A6FD91e-8237.01%UPI00021A6FD9 related cluster n=4 Tax=unknown RepID=UPI00021A6FD9
NCBI RefSeqXP_974168.16e-9035.50%PREDICTED: similar to CG10383 CG10383-PA [Tribolium castaneum]
NCBI nr blastpgi|910779301e-8835.50%PREDICTED: similar to CG10383 CG10383-PA [Tribolium castaneum]
NCBI nr blastxgi|910779309e-8835.31%PREDICTED: similar to CG10383 CG10383-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054888.2e-07binding
KEGG pathway 
InterPro domain[40-151] IPR0160248.2e-07Armadillo-type fold
[66-153] IPR0119892.9e-06Armadillo-like helical
Orthology groupMCL13000 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212556-TA
ATGATAAGTCATGTTCAAGCTTTGAATCCACATAGTTGTATTCAGTACTTTCTATCAAAATATTTTCTAAATATTCAAGATCAGATGATTGAAGTGGATAGTGTTCCAGCAAAACCTGATAATATTAATGAGAAACAACTTTGTATTTTATGTCTTGATGCATTACATCATCATCTGTTTTTATTTCATGATAGTACTTCTGATGAAAACTCTTCTGTTGCTGCTTTGAAAGAATTAGGTATTTTTCCAAAATTGGCAGAATTAATGTTGAGATACAGGAATGACTCTGAAATTGATTTTGCAGTTCTCAAGTTACTGACAGTTCTTAGTGTGTATCCTAGCCTACTAAAAGATTTTTATGAGAATGGTCTTATAGGAGAGCTTGCTCGACTAATAAAGTCACGTGATATTAGATTGGCAAGTTCATCAGCTGTTTGTTTAGCAAATTTATCAGGTGAATACCATTATAAGCCAGGTTTATATTTGCTCCATCCGTTATATCGTACCAAAAGTAATCATAATTGTGATATATTGCTTGTTCATGGTTTGAGAGGAGGTGTATTTGTTACTTGGAGACAAAGAGATAAAAAATGTGCAGAACCTGTTGGTATTATTGAAGTAACCGTCTCTGATGTTGAATGTGATCCATGTGAAAAGGACAACCTCTCCAAAACATATTTTGATCCTGAACTTCAACAAGTTCTAGAGGATTTGAAGGAATTGGATGATGAAGCTTTACTGACAAATTTAGAAGTAGTTTTACATGATATTCCTATATCTGCAAAGAGGGAACCTAGCAGTACCAAACAATATACATCAAATAAAAAACGTCTGGCTTTAATACAGGAAGATGAAGATAAATGTAATTATACACATTGTTGGCCTAAAGATTGGCTACCACAAGATTGTGATAGTTTGAGAATTCTTGGTTTCAACTACTGGAGTAAACTTTCTGAATGGCTAGAGGGGTGCCCTCTAAAAAATGCGGATATTGAATCGAGAGCTGAAGAACTTGGCTCTGTTTTAATTGATGCTGAAGTTGGTAAAAAAAGTATTGTTTGGCTGGCTCATTCAATGGGTGGTCTTATAGTAAAAAAATTATTGGTAGATGCAGCACAAAAGAGTGAACCCAGATTTGGAAATCTGTGTCAAAAAACTAAAGCAGTGTTATTTTACAGCACTCCACATAAGGGAAGTGCCTTAGCAACAATGCCTCGAGCTGCAGCAGCAATTCTTTGGCCCTCACAAGATGTTAAACAATTACAAGAAAATTCACCAGTTCTACTTAAAATGCATAATGAGTTTATAAAATTTGCAGATTTGTTCAACTGGGAAACGATAAGTTTTGCAGAAACACAACCAACTTTAGTAACAGCTTTCAAAGTTCCTGTTCATTTTGTAGAATCATTTTCAGCTGACTTAGGTCGTGGTGTATTCTATGAATTGCCTCTTGACCATCTGTCAATATGCAAACCAGCTACAAGGCAATCAATTCTATATACAACAGTATTAGATGTTTTATTAAGAGTGTCATCTTGTGAAGTTCAATTAAAACACACGGACTCACACATTTTAAGACTAATATACACTTTTTGGTCTCTTGTAAAAAGTAAATTTGGTTTGTTAAAAACTAATTTGGAAGATAATCAAGGAAATGGAAATCAACACTGGATTGAAAAAACTTTACTTCATGCATTCACTGATGACGTTATAACATAA

Protein sequence:

>DPOGS212556-PA
MISHVQALNPHSCIQYFLSKYFLNIQDQMIEVDSVPAKPDNINEKQLCILCLDALHHHLFLFHDSTSDENSSVAALKELGIFPKLAELMLRYRNDSEIDFAVLKLLTVLSVYPSLLKDFYENGLIGELARLIKSRDIRLASSSAVCLANLSGEYHYKPGLYLLHPLYRTKSNHNCDILLVHGLRGGVFVTWRQRDKKCAEPVGIIEVTVSDVECDPCEKDNLSKTYFDPELQQVLEDLKELDDEALLTNLEVVLHDIPISAKREPSSTKQYTSNKKRLALIQEDEDKCNYTHCWPKDWLPQDCDSLRILGFNYWSKLSEWLEGCPLKNADIESRAEELGSVLIDAEVGKKSIVWLAHSMGGLIVKKLLVDAAQKSEPRFGNLCQKTKAVLFYSTPHKGSALATMPRAAAAILWPSQDVKQLQENSPVLLKMHNEFIKFADLFNWETISFAETQPTLVTAFKVPVHFVESFSADLGRGVFYELPLDHLSICKPATRQSILYTTVLDVLLRVSSCEVQLKHTDSHILRLIYTFWSLVKSKFGLLKTNLEDNQGNGNQHWIEKTLLHAFTDDVIT-