Monarch geneset OGS2.0

DPOGS202186
TranscriptDPOGS202186-TA1254 bp
ProteinDPOGS202186-PA417 aa
Genomic positionDPSCF300149 - 608139-609392
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0091720.068.45% 
BombyxBGIBMGA013484-TA2e-15261.67% 
DrosophilaRpb4-PB3e-7035.33% 
EBI UniRef50UniRef50_D6WIZ33e-7839.52%Putative uncharacterized protein (Fragment) n=1 Tax=Tribolium castaneum RepID=D6WIZ3_TRICA
NCBI RefSeqXP_391932.31e-7840.29%PREDICTED: similar to transcriptional adaptor 2 (ADA2 homolog, yeast)-like [Apis mellifera]
NCBI nr blastpgi|3407226072e-7941.19%PREDICTED: transcriptional adapter 2-alpha-like [Bombus terrestris]
NCBI nr blastxgi|3071795862e-8238.46%Transcriptional adapter 2-alpha [Camponotus floridanus]
Group
Gene OntologyGO:00055155.4e-23protein binding
GO:00036776.2e-09DNA binding
GO:00063556.2e-09regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[317-417] IPR0090575.4e-23Homeodomain-like
[344-414] IPR0075262.1e-11SWIRM
[64-109] IPR0122876.2e-09Homeodomain-related
[62-111] IPR0010056.8e-09SANT domain, DNA binding
[65-108] IPR0147781e-08Myb, DNA-binding
Orthology groupMCL15559 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202186-TA
ATGGCAAATGATTTATTGCAAGTAAAGTGTGATATTTGCGACGAGATCGCTCACGAGCCCTATATAGAGTGTTGTGAATGCGACACTGTATTGTGTTGTTCCTGTTTCGCGTCGGGAAAAGAGAAAGATAATCACAGAAACGATCACAAGTACGCTATAAGAAAAAATGACTTTCCACTATTTGAAAACTGCAACTGGTCAGCTAAAGAGGAATGTAAGCTATTGAATGCACTATCTAATTATGGTTATGGAAATTGGGAAGAAATAGCTAAAAGTGTGCATACGAGATCGAAACTGGAATGCCAAGAGCATTATAAAAAGTATTACATAGAAAACGTGAAGTATGATGAGCTGAAATTATTACCGGAAACTAAAGAGTCATTATATCAACCACCTCTAACCCCATACCTGTATAACACAGATCTTAGTATAAACCCACCAAGAAATAACCAATCCGACCCACTTCTCGCCGGTTACAATGCTCATAGATCTGACTTTGAACTCAGCTATGACCATAACGCCGAAAACATATTCAGCACCGATATAAGCTATTCCGCTGATGATGAAGAGGACGATGAATGTATGGATTCGCTGAAGGTTAGTTTGGTCAGTGCACTAAACACTAGATTAAGAGAAAGGCAACGGCGTTACAACATCATCCAGGAACATGGACTCATCATGACCAATAAGTTATTGTCCTGGTTGAAGAGGTTCGATAGTACTCTGTCCAGATCCAAAGCAGAAAAACTACTGTCATTCATGCAGTTCATGAGTGGAATGCAGTTTGATAGCTTAATGGAGTCCCTTAGTTTAGAAGAGGAAATTCTCAATAGAATCGTAAGGCTGTGTGATTATCGGAGGAATGGGATACAAAACGACAAAGTCTATAAAGAACAGAAATATGTCACCAATATGATGATTAAGAAATTTGACAGTCAGTCACAGATGAAGAGTAAGAACAGTTTGTTTGGTAACAGTATCGGCAGCAAGAAGATAAAAAGAACACTCATGCCGCTGGACATATTGGACATGCCAGGATACCACCTGCTGTCGGACAGTGAGCGAGACTTGTGCTCCAATGTCAGAGTGATCCCGGAGAATTTTCTCGACATCAAAAGAGTCCTCATAGCGGAGAACAACAAACTGGGTTTCCTACGCTTACTGGATGCACGGCGAGTCGTCAAGATAGATGTGAATAAGACTAGGAAAATATATGATCACCTGTTGTCCGAGGGATTCATTGTAAAACCTTAG

Protein sequence:

>DPOGS202186-PA
MANDLLQVKCDICDEIAHEPYIECCECDTVLCCSCFASGKEKDNHRNDHKYAIRKNDFPLFENCNWSAKEECKLLNALSNYGYGNWEEIAKSVHTRSKLECQEHYKKYYIENVKYDELKLLPETKESLYQPPLTPYLYNTDLSINPPRNNQSDPLLAGYNAHRSDFELSYDHNAENIFSTDISYSADDEEDDECMDSLKVSLVSALNTRLRERQRRYNIIQEHGLIMTNKLLSWLKRFDSTLSRSKAEKLLSFMQFMSGMQFDSLMESLSLEEEILNRIVRLCDYRRNGIQNDKVYKEQKYVTNMMIKKFDSQSQMKSKNSLFGNSIGSKKIKRTLMPLDILDMPGYHLLSDSERDLCSNVRVIPENFLDIKRVLIAENNKLGFLRLLDARRVVKIDVNKTRKIYDHLLSEGFIVKP-