Monarch geneset OGS2.0

DPOGS203795
TranscriptDPOGS203795-TA1905 bp
ProteinDPOGS203795-PA634 aa
Genomic positionDPSCF300010 + 1601542-1604264
RNAseq coverage468x (Rank: top 26%)
Annotation
HeliconiusHMEL0125062e-17954.82% 
BombyxBGIBMGA003702-TA0.065.16% 
DrosophilaCG10286-PA4e-6329.15% 
EBI UniRef50UniRef50_F6LXF50.065.16%71 kDa protein n=2 Tax=Obtectomera RepID=F6LXF5_BOMMO
NCBI RefSeqXP_001650042.18e-9734.33%hypothetical protein AaeL_AAEL004920 [Aedes aegypti]
NCBI nr blastpgi|3505367450.065.16%71 kDa protein [Bombyx mori]
NCBI nr blastxgi|3505367450.065.78%71 kDa protein [Bombyx mori]
Group
Gene OntologyGO:00054888e-19binding
KEGG pathway 
InterPro domain[19-225] IPR0160248e-19Armadillo-type fold
[499-550] IPR0119891.2e-16Armadillo-like helical
Orthology groupMCL16435 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203795-TA
ATGGGAAAGGTGAGAAAACGAAAGGTACGACATAATGCACCGGTTAATAATTCTGATGAAGAAGAACTACCCGTAGAGTCTAAAGAGAGTGCAATTCAGACGATTATTGATCAGTTACAGGTATCAAATGTTGAAGAAAAGTATTGTGGTCTGCAAACATTCGCAATGCTCTTAGAATGTCCTGAAAATTTAGAACAAATGATAAAGCGAGGATTAATAAAGATAGTGGCACCACTCCTTCTAGATGCAGCCTCTTCTGTTAGAAATGCTGCAGCTGGTTCATTAAGAAATCTATCTGCTATTAGATTCGATCTGTGTGATGTGTTAATGGAACAAGATGTAATGACATCACTTATATGCTTTTTCCATCAGTTTGCTGAAAATTGGACTCCTGACCCGGCATCAAAGTCAAAGGATGAAGATATTGACACATTTGTTCAATGCACATATTTACTTCTAAACTTATGTGAAAGTTCAGAATTGGCAATGAAATATGTGGGAGATTCAAGAGTACTGGATATATTCTGTAGATACTTGGATTTATCAACTTTTGGTATTGATATAGTTACTGCTATTTTGCAGTGTTTATTTGTCATAGTCGAAGACAATCCCAAAGCAGTTGACAAATTAAAGAACTCATGTGAGCGGCAATTGAGAGAACTCTTGTCAATTGAGGGCAACGACCCTTCAAATTTGTTGATTAAAAGTGTAGCCGGAGGCCTAACAATAAGTCTATGTGGTGGAAATATTGTGACCCTCCCGACAGTTGTATTGAATCAAATAATAGGAATATTAGCTCAGACTCTATCAGTAGATCACCGTTTGGCTTGTAACCAATTGTCTAGTAATGTGCCATTGGGAGATGCGGCTGGGAAAGTTAAGATTCCAGAAGGAAAGGAAGCTATGGTGTTGGAGAAACAAGTTAAATCAGTCATACAAGTTTTAGATGCGCAACAGAGCGCTATTGAGATTATAACCAATATATGTTCTTCAGAAGATCTAGATGAGGTGATGGATGGAATGGAGTCTTCAGATAGTGATGAGCCCACAGAAGACAGTATTGGTGAAGGAACACTTCTCCCTGAAGATAAATTACCCCCTGAACTTTTGGAGGCATTAATATCATTGGAAATATATGACAAAATCTGGGCTCGTACACAGCTACCACCGGAAAATGTTATGTCTATCCTAAAAGAATATGAAGGAACACAATTAGTTTCTAAAAAGTTGTACAGCCTTCAAACTCGATCATTGTTGTGTGTAAATAATATGATAATGACACTGCCTTTAGATAATCTTGGAGGAATAAATGGGGTTTATAAAATTTGGGTTGATGCAGGGAAATTAGTGTTTAAGAATAACTCAAATAACTTTGACATATTAGAATCGGCAACGGCTGTAATGAGAGCCGCTTTAGATAAATTACAAGGATACTATTCAGCTAAAGATAGAAAACTTGACGACAACAATCTATTCAAAGATCTAGCCCTCTCTGATATTGAGATTATGTTGACAGGTATAAGGGATTGTCAAGTTCCTGAAATAAGATCAAACCTCATAAGAATGATTGGAACCATGGCATTGTTGCTGGTGAATAATTTGAATGATATTACAACTAATGTTATCATAACTATAACGGAATTCATAATAGAGCAGGCTCACAAGGAAAATGAAGTATGGGTCCTGGCAGAGGCCATTGACACCATAGTAGATTTATATTCAGAAGATGAAACTGATTCATTGGCAGCAAAAATCAAACTAGGCGACAAATTAAGCATGTTAGCACCAATACTGAAGAATAAGGCTCGGCAACAAAAGAAACTGCCAAAAGAATACAAGGTGCTTGTCGCTACGGCAAATTCAAATTTACCGAGATTTATTAAATATTTGAAAGGACGAACATAG

Protein sequence:

>DPOGS203795-PA
MGKVRKRKVRHNAPVNNSDEEELPVESKESAIQTIIDQLQVSNVEEKYCGLQTFAMLLECPENLEQMIKRGLIKIVAPLLLDAASSVRNAAAGSLRNLSAIRFDLCDVLMEQDVMTSLICFFHQFAENWTPDPASKSKDEDIDTFVQCTYLLLNLCESSELAMKYVGDSRVLDIFCRYLDLSTFGIDIVTAILQCLFVIVEDNPKAVDKLKNSCERQLRELLSIEGNDPSNLLIKSVAGGLTISLCGGNIVTLPTVVLNQIIGILAQTLSVDHRLACNQLSSNVPLGDAAGKVKIPEGKEAMVLEKQVKSVIQVLDAQQSAIEIITNICSSEDLDEVMDGMESSDSDEPTEDSIGEGTLLPEDKLPPELLEALISLEIYDKIWARTQLPPENVMSILKEYEGTQLVSKKLYSLQTRSLLCVNNMIMTLPLDNLGGINGVYKIWVDAGKLVFKNNSNNFDILESATAVMRAALDKLQGYYSAKDRKLDDNNLFKDLALSDIEIMLTGIRDCQVPEIRSNLIRMIGTMALLLVNNLNDITTNVIITITEFIIEQAHKENEVWVLAEAIDTIVDLYSEDETDSLAAKIKLGDKLSMLAPILKNKARQQKKLPKEYKVLVATANSNLPRFIKYLKGRT-