Monarch geneset OGS2.0

DPOGS208236
TranscriptDPOGS208236-TA1341 bp
ProteinDPOGS208236-PA446 aa
Genomic positionDPSCF300079 - 326075-337070
RNAseq coverage8457x (Rank: top 2%)
Annotation
HeliconiusHMEL0082122e-15197.89% 
BombyxBGIBMGA006426-TA2e-15590.49% 
Drosophilamod(mdg4)-PE2e-4770.59% 
EBI UniRef50UniRef50_Q6IE022e-15681.66%Mod(Mdg4)-heS00531 n=1 Tax=Bombyx mori RepID=Q6IE02_BOMMO
NCBI RefSeqNP_001106229.14e-15781.66%Mod(mdg4)-heS00531 [Bombyx mori]
NCBI nr blastpgi|1638386928e-15681.66%Mod(mdg4)-heS00531 [Bombyx mori]
NCBI nr blastxgi|1638386929e-15781.66%Mod(mdg4)-heS00531 [Bombyx mori]
Group
Gene OntologyGO:00055155.7e-23protein binding
KEGG pathway 
InterPro domain[4-115] IPR0113334.9e-28BTB/POZ fold
[24-116] IPR0130695.7e-23BTB/POZ
[32-127] IPR0002103.2e-20BTB/POZ-like
[287-347] IPR0075881.9e-08Zinc finger, FLYWCH-type
Orthology groupMCL20228 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208236-TA
ATGGCGTCGGACGAACAGTTCTCGCTATGTTGGAACAATTTCCATGCGAATATGTCAGCAGGCTTTCATGGCCTGCTGTCGCGTGGAGATTTAGTAGATGTAACATTGGCTGCCGAAGGTAGATTACTACAGGCACACAAATTAGTTTTATCAGTATGTTCTCCCTATTTTCAAGAAATGTTCAAAATGAACCCCACTCAACATCCCATAGTATTTTTAAAAGATGTTAGTCATTCAGCGCTTAGAGACTTATTACAGTTTATGTACCAAGGTGAAGTTAATGTTAAGCAAGAGGAATTAGCGTCGTTTATTAGTACCGCGGAACAACTTCAAGTTAAAGGTTTAACCGGTAATCAAAATGAAGAAAGTTCTACGCCATCCAAACCAAAGCCGACTTCGAGGCCAGGCCCGAGGTCGTCACAACAAAGGCAATCTGTTATGACTAAGTTAGAGACTGATTTAGATTCTAAGCCCTCCTCAACTCCAGTAGCAATTAAGAGACCAAATAGGCCATCAATAGCATCAAATAATTCGTCGTCATCTCAAAGTGGACCTGCGAAAAGAAAATGTGTGGACCCCTTAGAAGCCGGCCCATCAGGATCAGCCAAAGAGGAATTTGTCACGATACCAGACGAGGATGAAAATAATGCTGTTGCCCCCAAAATGGAACCCGAATTTGTTAATGAAAGTTTATGGGATGAAGACGACGACGGCACGAATAATGATGAAACGAACTTTGGCGAAGATGACTCGAATATGGAAATGACTGGTTTTGATGGCTCAACGACTGGCGATGGCAATTTAACTGGCGGAGGGGAGGGTGGTGCTGTGGGTGACGCACAAGCGCGTTTTATCCGAGCGGGAAAGGGTCGTCTGTTGTTGCACCGCGGATACACCTTCCGTCTGAAGAACAACCTGGCTCACGGACGAAAACAGTGGTACTGTTCGTCTCGGCTCACATCACGATGCCTGGCCGACGTGAACACCGAAGGTCCTGAGGGTTACGAACGTATCGTTCGTTCGCGTTACGCTCACAATCATGCTCCACCGAACGTTCGTCGTTTCGCCGACGGTCGCTATGTGGTCCGCACACCGCACCAGAGCGGTCATCGCGCCCCGGCTCCTTTAGATCCTCACCAGCAGCTCCTCCAACTTCAGCATGCTCTAGCTCTGTACGCTGCTACCGCTCACGCCACGGCCACGGCTAACGCCTCAGCTGCTGTGTCCGCTGCGACGGGAAGCGGCCAGCAACCATCCCCCAACGCTCTCCCCCCATCCGCACAACACACCCCCGCGGCCAAACCGCTCCTCGTTGACACTCCTTGTAATGAAGAAGATTAA

Protein sequence:

>DPOGS208236-PA
MASDEQFSLCWNNFHANMSAGFHGLLSRGDLVDVTLAAEGRLLQAHKLVLSVCSPYFQEMFKMNPTQHPIVFLKDVSHSALRDLLQFMYQGEVNVKQEELASFISTAEQLQVKGLTGNQNEESSTPSKPKPTSRPGPRSSQQRQSVMTKLETDLDSKPSSTPVAIKRPNRPSIASNNSSSSQSGPAKRKCVDPLEAGPSGSAKEEFVTIPDEDENNAVAPKMEPEFVNESLWDEDDDGTNNDETNFGEDDSNMEMTGFDGSTTGDGNLTGGGEGGAVGDAQARFIRAGKGRLLLHRGYTFRLKNNLAHGRKQWYCSSRLTSRCLADVNTEGPEGYERIVRSRYAHNHAPPNVRRFADGRYVVRTPHQSGHRAPAPLDPHQQLLQLQHALALYAATAHATATANASAAVSAATGSGQQPSPNALPPSAQHTPAAKPLLVDTPCNEED-