Monarch geneset OGS2.0

DPOGS203925
TranscriptDPOGS203925-TA2220 bp
ProteinDPOGS203925-PA739 aa
Genomic positionDPSCF300005 - 540505-551742
RNAseq coverage189x (Rank: top 48%)
Annotation
HeliconiusHMEL0135032e-5954.04% 
BombyxBGIBMGA002027-TA8e-7162.66% 
DrosophilaBtbVII-PF2e-4666.94% 
EBI UniRef50UniRef50_E1ZXG36e-7742.63%Protein bric-a-brac 2 n=8 Tax=Formicidae RepID=E1ZXG3_CAMFO
NCBI RefSeqXP_001120712.13e-8141.10%PREDICTED: similar to BTB-protein-VII CG11494-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838613243e-10837.32%PREDICTED: uncharacterized protein LOC100879573 isoform 2 [Megachile rotundata]
NCBI nr blastxgi|3838613224e-11537.98%PREDICTED: uncharacterized protein LOC100879573 isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00055158.1e-25protein binding
GO:00036771.9e-10DNA binding
KEGG pathway 
InterPro domain[71-185] IPR0113335.9e-30BTB/POZ fold
[91-184] IPR0130698.1e-25BTB/POZ
[99-194] IPR0002101e-24BTB/POZ-like
[386-427] IPR0078891.9e-10Helix-turn-helix, Psq
[378-440] IPR0090571.2e-06Homeodomain-like
Orthology groupMCL19697 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203925-TA
ATGTCAAGTCAAACTAAATTCTTAAAGTTCAAATTAGCACTGAAAGCGCCCTATGGCCAAGACAAGAAGAATGACTTCGATATCAGGTTTAGTCATTACTTGGTTCTTATGGTTTCTTACGAAATGTTATTACATTGTCAAGATAAACTGACACTGCTATTCAAAGCTGCCAACATTACTAATAATTTCCTCCAAATTGATGCCAAGATGTCTCAACAGTACAGTCTTCGGTGGAATAACCACCAGCCAAACTTCATTTCAATGTTTGGTAATTTACTTGCTACCAAAGATCTTGTGGATGTGACTTTGGCAGCTGAAGGCCAACACTTGGTTGCACACAAAGTTGTCCTTTCAGCATGTAGTACATACTTTCATTCACTCTTTGTGGATAATCCAACTCACCATCCAATAGTTATTCTTAAAGACATCACATTTAATGATTTACGCACAATGGTAGATTTTATGTATTATGGTGAAGTGAATGTCACGGAGCAGCAACTAGCACAGGTTTTGGAAACAGCAAAAATATTAAAAATAAAGGGGCTGACAGAAATGCCTGACTCTACTTCCTTAACTAGATCTCAGGGAATTCCCACAGATTTCCCAACAACAGAAACTTCAAGTGATACACAAAGGCCTTCAGTGTCACCCTCATCTCCTATTAGAAAGAAAAGGCAGAGAAAGAGTTCATCCGGTTCACTGGCAAATCCTGAAGAGCTCATGCAGAATGTTGATATGATGCGTGATGCAGTAACCTTAGGCAGCCTAAACATACAAAAGAAACGGGAAATGGATGAGAGAATCGAAACTGCACAACAGACCGAGGATGCATCAGCAATTAACATTGATCAACTTGCAATAGACACGAATGCTGTTGGGATGTCTCAAGGAGCCCAGTGGAATATGATGGAGCATCCATATCCCCGGTATACGAACGCGGGTATTGGTGGCGGCGGTTTACAACCGGAACAGGCCATGTATGTGAATAACGTGATACATCAACATGGAGACGACATGAGCCACTACGGACACACGCTGGGATCTACGGGGCCGGTGCCGGGACCCAGTGTACTCCAAGGGATGAATGATCAAGTTGGTCAGGCTCACAGTTCGGGTATGCCTGTAAAGAGACGACGCACCACTAACCCTCAGGCAGAAGAAAATTTTCAAAAAGCCCTGGAAGCGGTCCGCGTTGGTGGGATTGGTTTCTGCAAAGCAGCGCGAATGTTTGGTGTAAATAATCGAACATTGTGGCTCGAATATAAAAAGAAGGGTTATCCGAATAACAGACCAAGTATCAAGAATCGCATAAAGAGAGAACATATCACACCACCTCCCGAAATAAAGGAAGATCCTCCACAATTGGAACAGCAGATGGCACTTCTCTGTCAACCGCCTCCCGTCCCGCTTCACAGGACGCACTTGTTGACAAAACAAAAATCGCACACAACAATTTTTCTTGGAGCGTATATAAGGATAGGTGAAACCGCAAGTTGTGGGTTGTTTGAATATTTTATAGGTAATTTGGCTTCACCGTGTGGGGGACGTCTACGACGTTCAAAATACTTCAAGAGGCAAAATAGAGTTCAACCGGAAGAAAGGGAGGTCAGGAAACAGGTGTCGGAACCGACCCCCTCAACCGGCGCACACATTTTACAGAAACACGAGTCCTATCCACAGTATTCAGCTTCAAAGGGCTGCAACAAAACAACACAGCTGTTAGTATATATTACAGAGTGCTGGTTATCTTATAAAAAGTTTGCAATCAAAATGTTCAAGAATGATAATACTTTCGCCAGCACACATTTCAAAATACTATCATACCATACAAATCGCAAACCAAGGACGGAGCACGCGGTGGGGTCACTGTTGATCCGAGCAGTGGCAGTGGTGGATCGTCAAACGTCGACCAGCAGCACTACAGGCAGCACATCCACAGCAACACATACAGACACTGCGGAGACAGCGCAGACGCAGCGAGCATCACTCACGGACAGCATAGACGGGACACACACGCAGCGAGCTTCACACACGCCCGCAGTGCGTTCCGGACCCCCTCTCTCCTGCAACTTCTGCTGGAACACGGTCGACGAATGCGGTAGAATACTACGACGCAAAACACAATACCACTGTCCGGAATGTCGCACCAATCTCTGTATAGTCCCATGCTTCCATGAGTATCACGACAGTAACGGAGACGTGACGGCCAGCGCCAGGTGA

Protein sequence:

>DPOGS203925-PA
MSSQTKFLKFKLALKAPYGQDKKNDFDIRFSHYLVLMVSYEMLLHCQDKLTLLFKAANITNNFLQIDAKMSQQYSLRWNNHQPNFISMFGNLLATKDLVDVTLAAEGQHLVAHKVVLSACSTYFHSLFVDNPTHHPIVILKDITFNDLRTMVDFMYYGEVNVTEQQLAQVLETAKILKIKGLTEMPDSTSLTRSQGIPTDFPTTETSSDTQRPSVSPSSPIRKKRQRKSSSGSLANPEELMQNVDMMRDAVTLGSLNIQKKREMDERIETAQQTEDASAINIDQLAIDTNAVGMSQGAQWNMMEHPYPRYTNAGIGGGGLQPEQAMYVNNVIHQHGDDMSHYGHTLGSTGPVPGPSVLQGMNDQVGQAHSSGMPVKRRRTTNPQAEENFQKALEAVRVGGIGFCKAARMFGVNNRTLWLEYKKKGYPNNRPSIKNRIKREHITPPPEIKEDPPQLEQQMALLCQPPPVPLHRTHLLTKQKSHTTIFLGAYIRIGETASCGLFEYFIGNLASPCGGRLRRSKYFKRQNRVQPEEREVRKQVSEPTPSTGAHILQKHESYPQYSASKGCNKTTQLLVYITECWLSYKKFAIKMFKNDNTFASTHFKILSYHTNRKPRTEHAVGSLLIRAVAVVDRQTSTSSTTGSTSTATHTDTAETAQTQRASLTDSIDGTHTQRASHTPAVRSGPPLSCNFCWNTVDECGRILRRKTQYHCPECRTNLCIVPCFHEYHDSNGDVTASAR-