Monarch geneset OGS2.0

DPOGS207435
TranscriptDPOGS207435-TA1281 bp
ProteinDPOGS207435-PA426 aa
Genomic positionDPSCF300051 - 827051-831828
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0123357e-13077.91% 
BombyxBGIBMGA009907-TA1e-10876.29% 
Drosophilabr-PK2e-6375.97% 
EBI UniRef50UniRef50_O963764e-14872.49%Broad-complex Z4-isoform n=16 Tax=Obtectomera RepID=O96376_MANSE
NCBI RefSeqNP_001104804.14e-17078.56%broad-complex isoform Z1 [Bombyx mori]
NCBI nr blastpgi|1624616368e-16978.56%broad-complex isoform Z1 [Bombyx mori]
NCBI nr blastxgi|1624616360.080.23%broad-complex isoform Z1 [Bombyx mori]
Group
Gene OntologyGO:00055151.1e-24protein binding
KEGG pathwaydme:Dmel_CG114912e-61 
 K02174 (BR-C)maps-> Dorso-ventral axis formation
InterPro domain[4-117] IPR0113334.6e-29BTB/POZ fold
[22-117] IPR0130691.1e-24BTB/POZ
[32-127] IPR0002101.2e-20BTB/POZ-like
Orthology groupMCL14745 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207435-TA
ATGGTCGACACACAGCACTTCTGCCTCCGGTGGAACAATTACCAGAGCAGCATCACCAGCGCCTTCGAGAATCTTCGTGACGATGAAGACTTCGTGGACGTCACCCTGGCCTGTGACGGGAAGAGCCTTAAAGCACACAGGGTCGTTCTATCGGCCTGTAGCCCCTACTTTAGAGAACTTTTAAAGTCGACACCATGCAAGCACCCAGTGATCGTGCTGCAAGACGTGGCCTTCACGGACCTGCACGCGCTCGTGGAGTTCATCTACCACGGCGAGGTGAACGTGCACCAGCGGAGCCTCTCCTCTTTCCTCAAGACAGCGGAGGTCCTTCGCGTCTCCGGACTGACACAGAATGATGACGCCCAGGGGCCGCTAGTACAGAGCATCGCTCGCGCAGCCGCCGCGGCAGCTTCGTCCCCCCACACCCCGCCTCACCCCGCCCACACCCCTCAGAACCCTCACACCCCCAGCTACTCCGAGAAGCTAGAGGAAGCCCTGTTGCACCCGACGATGCGTCGTATCTCCCTCCCCCCGCGCCGCATATCCCGCTCGGCTGACAACTCGCCGGACGTCATCAAACGCGCCCGTCACGACAACAACAACGACCAGGCCCAGGTTCACGACTTCTCCACCAAGAACCATTCGATGAACAACACGCGCGCCCACAACGAACAAGGAAACGGGAACGGCATCTCCAACTCCAGCTCGTCGCCGTCCCCGCGGCTGATGGACGAAGTCAAGAACGAACCGCTCGACATGATCTGCCCCTCCAACCCGGACATAGACAGAAGCACAGACGACACGCCGCCGCATCATCACAGACCACTTGGTGGAGGTCCGCCTTCCCGCGCCAGCTCCGCGGAGGCCGACGACCGCACTCCTCCGCCGCAGCTGCCTCCGTCGTCGTTCATCTCCCCGGCAGACACCAAACTGTTCCCGCCTCACAACTACAACTACAGCATGGCGCTCGCCGATCCATCCGCGCTGGCCGGTCTGCCGAGCCCCCTGGCCCCGGACGGCATGGCGAGTACGTCTCAAGGTGGCGCCCGCACCCCCCAGGAAGAGTACCGATGCGAGCCCTGTAACAAGAGTTTGTCTTCCCTGACGCGGCTCAAACGACATATCCAAAACGTTCACATGCGACCGTCCAGGGAGCCCGTGTGTAACATTTGCCGACGGGTATACTCCAGCCTGAATAGTTTGAGAAACCACAAGTCGATCTACCACCGCAAGCAGCAACCGCCCTCCGCCGGCCAGGGCCCCTTCTACCCCGTCAATTGA

Protein sequence:

>DPOGS207435-PA
MVDTQHFCLRWNNYQSSITSAFENLRDDEDFVDVTLACDGKSLKAHRVVLSACSPYFRELLKSTPCKHPVIVLQDVAFTDLHALVEFIYHGEVNVHQRSLSSFLKTAEVLRVSGLTQNDDAQGPLVQSIARAAAAAASSPHTPPHPAHTPQNPHTPSYSEKLEEALLHPTMRRISLPPRRISRSADNSPDVIKRARHDNNNDQAQVHDFSTKNHSMNNTRAHNEQGNGNGISNSSSSPSPRLMDEVKNEPLDMICPSNPDIDRSTDDTPPHHHRPLGGGPPSRASSAEADDRTPPPQLPPSSFISPADTKLFPPHNYNYSMALADPSALAGLPSPLAPDGMASTSQGGARTPQEEYRCEPCNKSLSSLTRLKRHIQNVHMRPSREPVCNICRRVYSSLNSLRNHKSIYHRKQQPPSAGQGPFYPVN-