Monarch geneset OGS2.0

DPOGS213589
TranscriptDPOGS213589-TA3411 bp
ProteinDPOGS213589-PA1136 aa
Genomic positionDPSCF300033 + 473788-483990
RNAseq coverage260x (Rank: top 41%)
Annotation
HeliconiusHMEL0054771e-9456.45% 
BombyxBGIBMGA011657-TA0.066.41% 
Drosophila% 
EBI UniRef50UniRef50_D6WGG11e-9328.80%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WGG1_TRICA
NCBI RefSeqXP_969177.12e-9128.42%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3320240135e-10029.48%LisH domain and HEAT repeat-containing protein [Acromyrmex echinatior]
NCBI nr blastxgi|3320240136e-10329.07%LisH domain and HEAT repeat-containing protein [Acromyrmex echinatior]
Group
Gene OntologyGO:00054881.7e-16binding
KEGG pathway 
InterPro domain[510-1093] IPR0160241.7e-16Armadillo-type fold
[863-1076] IPR0119892.2e-12Armadillo-like helical
Orthology groupMCL17019 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213589-TA
ATGAATCCGTATAAAGATGTAGAAGAGAGTTCTTTCGTCGCGGCACCACATCTTACATATGAAGATATCGCTACTAAGTTGCTTAAGGATAACTTATTTTTAACTGCCCTAGAACTTCATACGGAACTTGTGGAAAGTGGAAAAGAGTTACCTCAGCTGAGAGAATTCTTTTCAAATCCTGGAAATTTCGAACAACATGTTTCACGCGCGTCAGAAATGGGAACTATTAATCGGACTCCAAGTTTAGCGACATTGGATTCGCTGGATACAGCAAGGTATTCTGAGGACGGAGGTGGCGATCGAGCTGGTAGTGGTTGTGATGTTGCTGTTTTAGAATTTGAACTTAGAAAAGCTAGAGAAACAATAAACTCTTTACGAGCTAACCTAACACAGTTTGCTGATGAATCACCTCTTGACAAGAACAATTCTGAAATCGATAGCCAAAGGACACTTAAACCACATGAGAAAAAAGCTCTCAATTTTCTTATAAATGAATATCTTCTTCTTCACAATTATAAACTAACGAGCATTACATTTTCTGATGAAAATCCAGATCAGGAATTTGAGGATTGGGATGATGTAGGTTTAAATATTCCCCGACCTGCAAACCTTATGTCTCTATTCTGGGGAAGTACACGCAGCTTAAGTGTACCGAAGACAGATGTTGCAACATACACAGATTTCTCATGCATAGACAGTGAATGCCAAACGGATTTGGATGAAAATGTGTGTGTGAGTTGTCAGACTTTAGACCATGACACAGATTGGAGCCATGAGCTTCTAATTCAGGTGGAAGAAATAGAACTATTGAAACAAAAAATAATAGCTCTCGAAACGGAAAAATTAAATTTCCAAAAACTATATGATGCTGCCATTGTTAGTCTCAACACGCTAACAAGTCCGATGTCCGAGAGTAAAACACTGGAATTACAAATACCAGATGAAAAAATTAATTCGAATTTAAAAGATCACTTAGAAGATATAAAACCTATTACGTGCATGGCTTTGGAAAACCATTATGGCAGCGGTAATTCAACTCACAGTGCTACGCCTGAACAATTTGAGATGATCTATGGTGATAAGAATAATTGCATGTCAAAGAAGGACGGTTCAAACACCAGTTCGTTTGAGCCGGCTGTATTGGACAGTGTCCGAGGGTCGCCGAGACGAGCCAGTGTTACCACATTGGATGAAACTCTCAGCATCAACGATGCTGGCGAATGGACCAGGGTTCACTATGAATATAATACTGTAGACAATAACAAAGAAATGTGGATCGACAGTGGTATACCGAGTGCCTTGAAGGATATGATAATGGGTTGGTGTAACGAGGCGTTAGGTACAAACGGTCCAATAAACAATGATTTACTTTTGGATCTCGTTAATAGTGAGAAGACAATCACACTGTCAGGATTATTACCACTTGTCGCGGACACGCTGCCGAGGGTGCTCCCCCATACCTTAGTGTCTCGTCGCGGCGAGGCCGCAGCCCTAGTAGCTGGTGCAGCAGCCCTACTATCTCCAGGTGACGCCAGGCGATCCAGACTCCTGCATACACTTCTCACACTATACAAGAAACCTGATCCCGAGGACGCGAAGATCATATGCGAAGCAACACGTCTCGTGGTGAAGTGGGGCGGTAGTGGGGAGGTTCTGTCCTCTATAGCTGAGCTGCTAGGTTCAAGGTCGTCCGAACGAAGAGTTCTAGCCAGTCAGATCTGCTTGGCAATAGCGCCTTATGTGCCGATAGAGCTGTGCACCTCACTGCTACTGAGTTTGGTGATGCTGATGAGCGAGTCCAGTGAAATAGAAGTGAGAAACATCGGGCTGAGAGCAGCCGTCCTCATATGTCCAGTGGCGGAACACAAGTATGGACAGTTGGAGGATTGCATGTTTAACTTCCTGAGAGATAAAGACGAGAAGATAGTTAAGGATACAGTGAACGTGTTTGTACCTGTCCTAGCGAGGAGTGCCATCATATCTGGTAAATTCTCAACGGATTTATTCAGCAAGGTGCTAGCGAATTTGAACAAATCTGGCTCAGACAATGAACGGAGGACAATGATCATGTATTTGGAGGTTCTGCAGTCGTTGGCATCCTCCGAACTGGTGTACGTGACCAATGTACAACTGGTCAGGGATGTGAATTGTGATATTGTCATGTCCGAAGTACCGTTGAGTGATCAAATAGATGTATATAACATGAGTGATTCTGATAGAGTGTTGTGTGTGGCAATGAACCGCCTGCTGAAGGAAGATCCCCATACCAGATGGGCCGAACTAAATTGGTTCATTGATGTTACAAAACAAATTTTAGATATAGGAATTAAATACAAAGCGTTGAACCATCCGGCGGTGTATGAAACACTTATAACATTATTCCATACATATGTTGATAAGTTCGGACATGACTTCACGGCTGCTGTACTCAGCGGCGTGTTTACTGAGATTATATTGGATTTAGAGAATAAATTAGAGAAATTACACGCCATTAGTGTGGACAGTATGGTCGTTGTGGGCATATACCTGGCGACTGTTTTGATTGAAGTAGAAAGTGTCGATCAACAAGCAGAGTTCTTACAGAAATGGACTATGTATAGCAGTATAAGGGGTTTGCCACCTAAAATATTATCGATTCCCTTGAAATGGTTGTCTCAACAGAGGCCGAGCACACTCAACACCTACATACATCATTTACGAGAATTTGCAGCGAGCAGTTGTGACTCATCAAGCGGCAGTACTATACGGATGTTCATAGCTAACCTTATAACGGAGTTTGTAAACACGACGGATGTCAACGAGGACTGTATCCACAACCAACTGTTGCCGGCCGTCATCGCACTGCTCTATGATGATGACGTATCTGTCCGCGAGGCGGCGATAACGTCGTGGGGCAGCGTTTCAAGGTCGTGTGTGAGTCGAGGGTTGTCCTGCTCAAAAAACTGCTGGCCGGCCTTCGAGGAGGTCGTACGGAGACCGCTGGTCGGCAGGGAGTTAGCGCGCGCAGCCGAGGCACTCGCTATGCTATTACTACCAATAGACGGACAAACAGTTTGCGAAAAAGCGGTGTCGGTGCTGTGCTCGTTGTCTATGAGCGTGTCTATGGTCGATAGTGAAGTGATGTCGTCTCTGGCACCGGCGCTCCAGTTAGCGTCGCATCACTGTCCCCAGCACCCAGCACTACTACCAGCTCTCAGGAAATTAGAGGAAATCGTCCAATCACCTTCTATGGCCCAATACAAGCCAGCAATAGAAGCCCTTCTCCACGTGGCCGGTACTGAGGCTGTAGACAGTTCCCCTCGGTCCAGCAACCTTCATACAGCGCAGGAAGTTGGCAGAAGAGTCACTCAGATCTTCCAGCAATCCAAAACCAACATAAACCTTCCAAATATATTTAGAAAGAAAACTTAG

Protein sequence:

>DPOGS213589-PA
MNPYKDVEESSFVAAPHLTYEDIATKLLKDNLFLTALELHTELVESGKELPQLREFFSNPGNFEQHVSRASEMGTINRTPSLATLDSLDTARYSEDGGGDRAGSGCDVAVLEFELRKARETINSLRANLTQFADESPLDKNNSEIDSQRTLKPHEKKALNFLINEYLLLHNYKLTSITFSDENPDQEFEDWDDVGLNIPRPANLMSLFWGSTRSLSVPKTDVATYTDFSCIDSECQTDLDENVCVSCQTLDHDTDWSHELLIQVEEIELLKQKIIALETEKLNFQKLYDAAIVSLNTLTSPMSESKTLELQIPDEKINSNLKDHLEDIKPITCMALENHYGSGNSTHSATPEQFEMIYGDKNNCMSKKDGSNTSSFEPAVLDSVRGSPRRASVTTLDETLSINDAGEWTRVHYEYNTVDNNKEMWIDSGIPSALKDMIMGWCNEALGTNGPINNDLLLDLVNSEKTITLSGLLPLVADTLPRVLPHTLVSRRGEAAALVAGAAALLSPGDARRSRLLHTLLTLYKKPDPEDAKIICEATRLVVKWGGSGEVLSSIAELLGSRSSERRVLASQICLAIAPYVPIELCTSLLLSLVMLMSESSEIEVRNIGLRAAVLICPVAEHKYGQLEDCMFNFLRDKDEKIVKDTVNVFVPVLARSAIISGKFSTDLFSKVLANLNKSGSDNERRTMIMYLEVLQSLASSELVYVTNVQLVRDVNCDIVMSEVPLSDQIDVYNMSDSDRVLCVAMNRLLKEDPHTRWAELNWFIDVTKQILDIGIKYKALNHPAVYETLITLFHTYVDKFGHDFTAAVLSGVFTEIILDLENKLEKLHAISVDSMVVVGIYLATVLIEVESVDQQAEFLQKWTMYSSIRGLPPKILSIPLKWLSQQRPSTLNTYIHHLREFAASSCDSSSGSTIRMFIANLITEFVNTTDVNEDCIHNQLLPAVIALLYDDDVSVREAAITSWGSVSRSCVSRGLSCSKNCWPAFEEVVRRPLVGRELARAAEALAMLLLPIDGQTVCEKAVSVLCSLSMSVSMVDSEVMSSLAPALQLASHHCPQHPALLPALRKLEEIVQSPSMAQYKPAIEALLHVAGTEAVDSSPRSSNLHTAQEVGRRVTQIFQQSKTNINLPNIFRKKT-