Monarch geneset OGS2.0

DPOGS212156
TranscriptDPOGS212156-TA1341 bp
ProteinDPOGS212156-PA446 aa
Genomic positionDPSCF300038 + 580892-584807
RNAseq coverage202x (Rank: top 47%)
Annotation
HeliconiusHMEL0125363e-16290.48% 
BombyxBGIBMGA006736-TA3e-12569.97% 
Drosophilabab2-PA3e-4157.63% 
EBI UniRef50UniRef50_UPI00020647504e-5478.86%UPI0002064750 related cluster n=3 Tax=unknown RepID=UPI0002064750
NCBI RefSeqXP_973130.19e-5553.40%PREDICTED: similar to broadZ1 [Tribolium castaneum]
NCBI nr blastpgi|3287904972e-5378.86%PREDICTED: hypothetical protein LOC412161 [Apis mellifera]
NCBI nr blastxgi|910913808e-5455.02%PREDICTED: similar to broadZ1 [Tribolium castaneum]
Group
Gene OntologyGO:00055152e-26protein binding
KEGG pathwaydme:Dmel_CG114913e-38 
 K02174 (BR-C)maps-> Dorso-ventral axis formation
InterPro domain[9-122] IPR0113333.7e-30BTB/POZ fold
[27-122] IPR0130692e-26BTB/POZ
[37-131] IPR0002108.2e-23BTB/POZ-like
Orthology groupMCL25271 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212156-TA
ATGGAAGGTACAGAACTGGGTGGCGACCAGCAATTCTGTTTGAGATGGAACAATTTTCAAGCAAATATAACTTCTCAATTCGAGGCACTGCGCGATGATGAGGACTTCGTTGATGTCACCCTGGCATGCGAGGGACACAGGCTCGAAGCCCACAAAGTGGTTCTATCGGCCTGTAGTCCTTATTTCAAGGAATTATTTAAAAACAATCCATGCCCTCATCCGATAATATTCATGCGTGACTGTGAGGTGTCGCACGTGCGGGCGCTCTTACAGTTTATGTACGTGGGCCAGGTGAACATCGCGCAGGCACAGCTCAGCGCCTTCCTCAGAACAGCTGACGCGCTGCAGATCCGCGGCCTCACTGACTGCTCGCAACACAATGACAAAAAAGTTAACAGAAAGTCGCCACCCTCGCAACTACGTAATTTGCTCAGCGCCAAGCCGTCACATTCTACCTCCTCCTCCAAAGCCGCAAGCCAAAACGTGGAATCGACGTGTGCCGATGACCTCGAGAAGGGTGCGAATCGCCGCTCCGAGATAAATTCTCCCGATGCTGCCAGAAATAATCTCAACGAAGAAGCTTCCTTTCAAACTTCTAGTCAAACGAGGCCTAATAATGACGATTTAAACCAAACTTCAAACTACCCCACATTGAGGGTGAAAACTGATCTCGAAGCAACTGAAATTAATAATGAAGAAAACGATATTCCAATGGACCCCGGGGACTCTGAAGAAGCGGACAAGTGTCAAGATTTCACAGCCACGGACCTTCTCGAACCTAAGATGGAAGTCATGGAGCAAGAGGTCAGCGACGAGGAGCGTTCCAACTTCCAATCATACTTCAATGAAAACAATGCTCTAGCCAATCCGAACCCTTTCGCCACATTACAAGGTAATATCGATCTCATGGCCGGAATGAATTCGGAGCTCCGGGACGAAAACGCGGAAGCGGTGTGGGAGGTGGTCGGGTGGTTCTGGCGTGTGGGGGTGGGGGGTGGTGGTGGCGGGCGCAGTGCGAACGTGGTGTTGTTGGCAGTGTCGCCTGATCCACGAGGCCCAGCCGGTGGTGTGCTCCACTTGCGGCCGCTGCTTCAAGACGCCGCTGTACCTCCGGCGACACACGCTGGCGCAGCACCCCGCGCACCCCGCGCGACTCAAGCCGCCGCCGCCGCCGCTGCCCGCGTTGCGGCCCGCGCACCACTAGCGCACGCGCTCTCTTCTAGTCCATTCCTCACTCCGCTGTATCTCTCTCTATCTCTGTCTCTCTCCCACATCGAGGGCCACTCGCATCGCATACCTCATGTACATTATTATTTCCCCCTCCCCCATTTGGCGCATTAA

Protein sequence:

>DPOGS212156-PA
MEGTELGGDQQFCLRWNNFQANITSQFEALRDDEDFVDVTLACEGHRLEAHKVVLSACSPYFKELFKNNPCPHPIIFMRDCEVSHVRALLQFMYVGQVNIAQAQLSAFLRTADALQIRGLTDCSQHNDKKVNRKSPPSQLRNLLSAKPSHSTSSSKAASQNVESTCADDLEKGANRRSEINSPDAARNNLNEEASFQTSSQTRPNNDDLNQTSNYPTLRVKTDLEATEINNEENDIPMDPGDSEEADKCQDFTATDLLEPKMEVMEQEVSDEERSNFQSYFNENNALANPNPFATLQGNIDLMAGMNSELRDENAEAVWEVVGWFWRVGVGGGGGGRSANVVLLAVSPDPRGPAGGVLHLRPLLQDAAVPPATHAGAAPRAPRATQAAAAAAARVAARAPLAHALSSSPFLTPLYLSLSLSLSHIEGHSHRIPHVHYYFPLPHLAH-