Monarch geneset OGS2.0

DPOGS214562
TranscriptDPOGS214562-TA1137 bp
ProteinDPOGS214562-PA378 aa
Genomic positionDPSCF300266 + 155307-158795
RNAseq coverage855x (Rank: top 15%)
Annotation
HeliconiusHMEL0161150.084.72% 
BombyxBGIBMGA003222-TA0.086.91% 
Drosophilamri-PA3e-13368.04% 
EBI UniRef50UniRef50_E2A4763e-14182.82%BTB/POZ domain-containing protein 10 n=5 Tax=Endopterygota RepID=E2A476_CAMFO
NCBI RefSeqXP_395499.23e-14371.39%PREDICTED: similar to mrityu CG1216-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3320242612e-14270.05%BTB/POZ domain-containing protein 10 [Acromyrmex echinatior]
NCBI nr blastxgi|2700046007e-14166.33%hypothetical protein TcasGA2_TC003964 [Tribolium castaneum]
Group
Gene OntologyGO:00055159.2e-08protein binding
KEGG pathwaydme:Dmel_CG12162e-131 
 K00864 (E2.7.1.30, glpK)maps-> Plant-pathogen interaction
    Glycerolipid metabolism
    PPAR signaling pathway
InterPro domain[88-186] IPR0113331.6e-17BTB/POZ fold
[88-191] IPR0002109.2e-08BTB/POZ-like
Orthology groupMCL14433 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214562-TA
ATGTCTGACTCTCAAGCCAACGGCCAGGGGGCCATGTCAGAGCCACGCAGATCCTTTTTCTACCCAGACAGCAGCAGTGATACAGAAGAGTACAGAACAGACGCTGAAGATCGTCGTAAGCGTCTGTCCAAACGTAACGGACCGAATATTAGAAGGATGCCGCCAAACATGCCACCAAAAAACCAGAATCCCAACGCTCCTATACCTTCCACATCAGGCCAGGATTTTCCGAAAAACGGTCAGAAAAATCAGTTATGCGAAGACAGGATCACGTTGGTTGTTGATAATACGAGATTCGTCGTCGACCCCGCACAGTTCACAGCGCATCCGAATACAATGCTCGGACTTATGTTTAGTTCAAGTAAAGAGCTAACACATCCGAACGAGCGGGGTGAATATGAAGTAGCCGAGGGCATATCAGCGACCGTGTTCAGAGCTATCCTGGAGTACTATCGTGGGGGAACTATCAGGTGTCCGCCGACAGTATCGGTTCAGGAGTTGAGGGAAGCCTGCGACTACTTACTTGTGCCGTTCGATGCTAATACTGTCCGATGTCAAAACCTTCGCGGTCTGTTACACGAGCTGTCAAACGAGGGCGCCCGCCGCCAGTTCGAGAGTTTCCTGGAGCGTCTCATCCTGCCGCTGATGGTTGAGTCAGCTGAGCGGGGTGACCGCGAGTGTCACGTGGTCGTGCTCTTGGACGACGACAGTGTAGACTGGGACGAACAGTATCCACCACAGATGGGAGACGAGTACAGCCAGACTGTCCTGTCCACGCCCTTATACCGGTTCTTCAAGTATATTGAGAATAGGGATGTCGCCAAACAAGTGATGAAGGAGCGCGGCTTAAAGAAGATTCGGCTCGGCGTGGAAGGTTATCCGACTTACAAGGAGAAAGTAAGAAAACGTCCCGGAGGAAGGGCAGAGGTCATATACAACTATGTCCAGCGACCCTTCATACACATGTCCTGGGAGAAAGAGGAGGCCAAGAGCCGGCATGTGGATTTTCAGTGCTTCAAATCGAAATCTGTTACCAACTTGGCAGAAGCCACCGCTGACCCTGTGATAGAGTTAGAGAATAGAGAAAGAGAGGTCGAAGTCCAGGAACCGGGGGAAGTGGTTGAGGAGGAACAGTGA

Protein sequence:

>DPOGS214562-PA
MSDSQANGQGAMSEPRRSFFYPDSSSDTEEYRTDAEDRRKRLSKRNGPNIRRMPPNMPPKNQNPNAPIPSTSGQDFPKNGQKNQLCEDRITLVVDNTRFVVDPAQFTAHPNTMLGLMFSSSKELTHPNERGEYEVAEGISATVFRAILEYYRGGTIRCPPTVSVQELREACDYLLVPFDANTVRCQNLRGLLHELSNEGARRQFESFLERLILPLMVESAERGDRECHVVVLLDDDSVDWDEQYPPQMGDEYSQTVLSTPLYRFFKYIENRDVAKQVMKERGLKKIRLGVEGYPTYKEKVRKRPGGRAEVIYNYVQRPFIHMSWEKEEAKSRHVDFQCFKSKSVTNLAEATADPVIELENREREVEVQEPGEVVEEEQ-