Monarch geneset OGS2.0

DPOGS201089
TranscriptDPOGS201089-TA1131 bp
ProteinDPOGS201089-PA376 aa
Genomic positionDPSCF300185 + 269751-272201
RNAseq coverage491x (Rank: top 25%)
Annotation
HeliconiusHMEL0052150.098.14% 
BombyxBGIBMGA007155-TA0.095.21% 
Drosophilalute-PC2e-8642.74% 
EBI UniRef50UniRef50_Q9BX701e-16172.07%BTB/POZ domain-containing protein 2 n=70 Tax=Bilateria RepID=BTBD2_HUMAN
NCBI RefSeqXP_002740428.18e-15970.67%PREDICTED: BTB (POZ) domain containing 2-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3343267464e-16372.87%PREDICTED: BTB/POZ domain-containing protein 2-like [Monodelphis domestica]
NCBI nr blastxgi|3343267461e-15772.87%PREDICTED: BTB/POZ domain-containing protein 2-like [Monodelphis domestica]
Group
Gene OntologyGO:00055152.7e-11protein binding
KEGG pathway 
InterPro domain[227-375] IPR0129836.1e-47PHR
[2-68] IPR0113339.4e-14BTB/POZ fold
[74-182] IPR0117053.8e-12BTB/Kelch-associated
[2-68] IPR0130692.7e-11BTB/POZ
Orthology groupMCL16747 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201089-TA
ATGTTTAATGGGGTGCTAGCAACAAAATCTGATGAAGTAGAACTCCCAGATGTTGAACCAGCAGCATTTCTACATTTACTCAAGTTTCTATACTCAGATGAGGTCAGAATTGGGCCAGAAAGCGTCATGACAACCCTGTACACCGCTAAGAAATATGCTGTTGCAGCACTGGAGGAACATTGTGTTGATTTTCTTAAGAGCAATCTAGGTACGGACAATGCATTTCTATTACTGACCCAGGCAAGACTGTTTGATGAACCACAGCTGGCCGCTCTCTGTCTGGAGATGATCGACAAGAACACAACGGACGCACTAAATGCCGAGGGCTTCACTGACATAGACCAAGATACATTGAATGCCGTATTAGAAAGAGACACTTTACGAATCCGCGAAGCGAAAATCTTTGCTGCGGTGCTTCGGTGGTCGGAGGCGGAGTGTATCCGACGACAGCTGCCCGTCACACCCAGCAACCAGAGGATGGTGCTGGGCAGAGCCTTCCACGCTATCAGATTCCCTCTCATGTCAGTGGAGGAGTTTGCGATGGGTCCAGCCCAAAGTGGACTATTGGACGACCGAGAGATAGTCCAATTATTTCTATACTTTACAGTCAATCCCAAACCGAATGTAGGTTTCCTGGACACTCCCCGATGTTGCATGACCGGTAAGGAATTGACCGTGAACAGGTTCTCGCAGACTGAATCTCGTTGGGGCTACAGTGGAACAACTGATAGGGTCAGATTTACAGTGGATCAGAGAATTTTTGTCGTCGGTTTTGGGCTGTATGGATCGTATTTCGGACCTTCGGAATATGAAGTGCACTTACAGATAATTCACCTGGCCACCAAGAAGGTGTGCGGCTCCAACACGACCACGTTCTGTTGTGACGGCTCCGACGACACCTTCCGCGCTATGTTCAAGGAACCGGTCGAGATACTCCCTAACACCTCGTACATAGCCAGCGCTAAGCTCAAGGGCACCGACTCGTACTACGGTACTCGCGGCTTGAGGCGAGTCACGGCTGACTGTAACAACGGGGAGAAGGTGGTTTTCCAATTCTCATACGCGGCTGGAAATAACAATGGGACCTCGGTGGAAGACGGACAGATACCGGCCATCATATTCTATATATAA

Protein sequence:

>DPOGS201089-PA
MFNGVLATKSDEVELPDVEPAAFLHLLKFLYSDEVRIGPESVMTTLYTAKKYAVAALEEHCVDFLKSNLGTDNAFLLLTQARLFDEPQLAALCLEMIDKNTTDALNAEGFTDIDQDTLNAVLERDTLRIREAKIFAAVLRWSEAECIRRQLPVTPSNQRMVLGRAFHAIRFPLMSVEEFAMGPAQSGLLDDREIVQLFLYFTVNPKPNVGFLDTPRCCMTGKELTVNRFSQTESRWGYSGTTDRVRFTVDQRIFVVGFGLYGSYFGPSEYEVHLQIIHLATKKVCGSNTTTFCCDGSDDTFRAMFKEPVEILPNTSYIASAKLKGTDSYYGTRGLRRVTADCNNGEKVVFQFSYAAGNNNGTSVEDGQIPAIIFYI-