Monarch geneset OGS2.0

DPOGS214572
TranscriptDPOGS214572-TA1608 bp
ProteinDPOGS214572-PA535 aa
Genomic positionDPSCF300050 - 695877-698560
RNAseq coverage559x (Rank: top 23%)
Annotation
HeliconiusHMEL0091080.092.05% 
BombyxBGIBMGA005139-TA0.091.79% 
Drosophilalute-PC0.077.53% 
EBI UniRef50UniRef50_UPI00020627CA0.075.94%UPI00020627CA related cluster n=3 Tax=unknown RepID=UPI00020627CA
NCBI RefSeqNP_001040171.10.091.60%BTB/POZ domain containing protein [Bombyx mori]
NCBI nr blastpgi|3083902710.092.15%BTB domain-containing protein [Helicoverpa armigera]
NCBI nr blastxgi|3083902710.092.15%BTB domain-containing protein [Helicoverpa armigera]
Group
Gene OntologyGO:00055154.5e-25protein binding
KEGG pathway 
InterPro domain[374-519] IPR0129831.4e-48PHR
[94-217] IPR0113337.8e-33BTB/POZ fold
[118-218] IPR0002104.5e-25BTB/POZ-like
[111-217] IPR0130691e-23BTB/POZ
[224-333] IPR0117054.3e-15BTB/Kelch-associated
Orthology groupMCL11369 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214572-TA
ATGTCCAATTTATGTTCAAGGATTGTGTCAAAACCTCCGAAAAGATTAACTGAGAATAGAGATAGTATGTCAGTCGCACAAACTAACACTTGGATGAATGCGGAAAACATAAACAATGGCGGTGGTTTGTCGTTATCACCCCCACATACAATTTCTCAACGTGAAACGGCAATGCAGGTGTCACAATGTTATAGTGGCCCACCATCACCATGTGGTTCATCAAGTTCAACCCCCTCACCTACGTTATCCCCAGCACCACCCCCAGGGACAGCAACCCTAGATCCTAATTGGCAAGCTACAAAGCCTACAATTAGAGAAAGAAATGCAGCTATGTTTAACAATCAGCTAATGGCTGATATCACGTTCATTGTTGGAAGTCCTGGACATACACAAGTTATACCAGCTCATAAATATGTATTAGCAACAGGGAGTTCAGTATTCTATGCAATGTTCTATGGTGGTCTTGCTGAATGTAAACAAGAGATAGAAGTGCCAGATGTTGAACCATCAGCCTTCCTCGCATTACTTAAGTATTTATATTGTGACGAGATTCAATTAGAAGCTGATACAGTATTATCAACATTATATGTTGCCAAGAAATATATAGTTCCTCATTTGGCAAGGGCGTGTGTGAACTATTTGGAAACTAGTCTGACTGCAAAAAATGCATGTTTGCTTTTAAGCCAATCCAGATTATTTGAGGAACCTGAATTAATGCAGAGGTGTTGGGAAGTAATTGATGCCCAGGCAGAGATGGCTCTGACATCTGAAGGCTTTGTTGACATAGATGTTTCTACTCTTGAATCGGTATTGGCAAGAGAAACACTAAATTGCAAAGAAATTAACTTATTTGAAGCTGCCTTAGCTTGGGCACAAGCTGAATGTGTACGAAGAGAAATTGATGCCACTCCAGTAAACAAAAGATCTATGCTAGGGAGCGCCATATTTCTGATCAGATTTCCGACAATGACATTGGAAGAGTTTGCTAATAGTGCTGCTCAACTGGGTATTCTCACACCACAAGAAACTATAGACATATTTCTACATTTTACTGCAGCTAGCAAACCACAACTGTCCTACCCCATTAAAGCCAGGGCGGGACTTAAGGCTCAGATTTGCCACAGATTCCAATCTTGTGCCTACAGAAGTAACCAATGGAGATATCGAGGTAGATGTGACTCAATACAATTTTGTGTTGACAAGAGAATATTTGTTGTTGGATTTGGACTTTATGGTTCATCAAATGGTGCAGCAGATTATAATGTTAAGATTGAGTTAAAGCGACTCGGAAGGGTCTTGGCTGAGAATAACACAAAGTTCTTCTCCGATGGATCAAGCAACACTTTTCATGTGTACTTTGAGAATCCAATACAAATTGAGCCAGAATGTTTTTACACTGCTTCTGCTATACTCGATGGCAGCGAGCTGAGTTATTTCGGCCAAGAAGGTCTAAGTGAAGTATATATGGGAACTGTGACTTTCCAGTTTCACTGTTCATCAGAAAGTACAAATGGAACTGGTGTCCAAGGAGGTCAGATTCCAGAGCTAATTTACTACGGGCCAACTATTAACAACTCTATAGCCAATACAAACTCTAATGAGGACTGA

Protein sequence:

>DPOGS214572-PA
MSNLCSRIVSKPPKRLTENRDSMSVAQTNTWMNAENINNGGGLSLSPPHTISQRETAMQVSQCYSGPPSPCGSSSSTPSPTLSPAPPPGTATLDPNWQATKPTIRERNAAMFNNQLMADITFIVGSPGHTQVIPAHKYVLATGSSVFYAMFYGGLAECKQEIEVPDVEPSAFLALLKYLYCDEIQLEADTVLSTLYVAKKYIVPHLARACVNYLETSLTAKNACLLLSQSRLFEEPELMQRCWEVIDAQAEMALTSEGFVDIDVSTLESVLARETLNCKEINLFEAALAWAQAECVRREIDATPVNKRSMLGSAIFLIRFPTMTLEEFANSAAQLGILTPQETIDIFLHFTAASKPQLSYPIKARAGLKAQICHRFQSCAYRSNQWRYRGRCDSIQFCVDKRIFVVGFGLYGSSNGAADYNVKIELKRLGRVLAENNTKFFSDGSSNTFHVYFENPIQIEPECFYTASAILDGSELSYFGQEGLSEVYMGTVTFQFHCSSESTNGTGVQGGQIPELIYYGPTINNSIANTNSNED-