Monarch geneset OGS2.0

DPOGS211311
TranscriptDPOGS211311-TA1335 bp
ProteinDPOGS211311-PA444 aa
Genomic positionDPSCF300125 - 67877-71808
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0093636e-10482.87% 
BombyxBGIBMGA004964-TA3e-16890.00% 
DrosophilaAda2b-PB7e-10754.74% 
EBI UniRef50UniRef50_D6WTS83e-10946.07%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WTS8_TRICA
NCBI RefSeqXP_312792.48e-11449.16%AGAP003109-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3800115371e-11250.00%PREDICTED: transcriptional adapter 2B-like [Apis florea]
NCBI nr blastxgi|3800115371e-12150.51%PREDICTED: transcriptional adapter 2B-like [Apis florea]
Group
Gene OntologyGO:00055151.6e-11protein binding
GO:00082707.1e-11zinc ion binding
GO:00036773.8e-08DNA binding
GO:00063553.8e-08regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[63-119] IPR0090571.6e-11Homeodomain-like
[9-49] IPR0004337.1e-11Zinc finger, ZZ-type
[68-105] IPR0147782.6e-08Myb, DNA-binding
[68-124] IPR0122873.8e-08Homeodomain-related
[67-116] IPR0010057.4e-06SANT domain, DNA binding
Orthology groupMCL13509 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211311-TA
ATGTCGTTTTCAGATCTTTATGCTAAATATAATTGTACATATTGTCAAGAAGAAATCAATGGAGTACGTGTAAGATGTGCAGAATGTAAAGATTTTGATATTTGTTTGCAGTGTTTTTCTCTTGGAGCTGAAATTGGACCACACAAAAATGACCATTCCTATCAATTTATGGACTCTGGAGCTTTTGGAATATTTTTAGGTAGAACTAGTTGGTCTGCTAATGAAGAAGTAAGATTGCTAGATGCTATAGAACAGTTTGGATTTGGAAATTGGGAAGATATTGCTAAGCATATAGAAACCAAAACACCAGAAGAAGCCAAAGATGAGTATATTACTAGGTATTTGGAGGGTAGTATAGGTCGGGCTACATGGGGTAATGTGGAGAGCACTAGTCGGCCATCACTCCACTGCGCTGATAGAGATGAAGGTCCACTGAGTCCTAGTGCAGTATCAAGACTTCCACCACTAGCTATAACTGCTGATGAAGCGGCCCAGCTCGGTTATATGTCAAACAGAGATGACTTTGAAAGGGAGCATGATCATGAAGCAGAGCAATTAATATCAACATTGTCTCTTAACCCCGAGGATGACAATTTGGATGTTGCGTTGAAGTTGTCGCAAGTAGATATTTACACTCGAAGGTTGAGGGAAAGGACGAGACGGAAAAGGCTGGTACGGGATTATCAACTGGTGTCAGTATTTTTCAACAATCAGAGAAATAAACAGAAAACCCTTGGAAAACTTGCCAAAGAAAAAAAGGAGTTTACTGATCGTCTTAGATGGACGGCCCAGTTCTACGGTCGTTCGGAGCAGGCTGCCGTGGTAGCGGGTCTGTGGAGGGAACGAGAATTGAGGGTCCGCCTGGCTGAGCTTCATCGATACAGACTTGCCGGCGTTACCCGACTCGAGGAATGCGCCCACTACGAACAACACGCTGCGCATAGGAAACATCCGCATCACATCGACGTGAGACGCGTCATGGGCAGCAGTGGGTGCCTGGACGCGCAACAGACAAAAGAATCAACACAGACCAACACTCCGCAGCAGCTAAGAAAAAGAGACGTAGAAAGCGGCTCAAGTTCCACCAGCCCAAAGTGCACACGGGAAGGAAGTACCGCATGTGGATGTTGCAGAAAGAGCTCATGCAGCGCAGGATGCTCGACACATCTGCTGACCACTAATGAAATACAGTTATGTACAGCCCTCAATCTGCCTGCCACTCAGTATGTAACACTAAAGGGAGTGTTACTTCGTAAGCCAGCTCAGTCCCCTGACGCTGATGTGGATAGAGCAGTGAGGAAATATTTGTCAAATGCTGGGTGGCTTCACCATTAA

Protein sequence:

>DPOGS211311-PA
MSFSDLYAKYNCTYCQEEINGVRVRCAECKDFDICLQCFSLGAEIGPHKNDHSYQFMDSGAFGIFLGRTSWSANEEVRLLDAIEQFGFGNWEDIAKHIETKTPEEAKDEYITRYLEGSIGRATWGNVESTSRPSLHCADRDEGPLSPSAVSRLPPLAITADEAAQLGYMSNRDDFEREHDHEAEQLISTLSLNPEDDNLDVALKLSQVDIYTRRLRERTRRKRLVRDYQLVSVFFNNQRNKQKTLGKLAKEKKEFTDRLRWTAQFYGRSEQAAVVAGLWRERELRVRLAELHRYRLAGVTRLEECAHYEQHAAHRKHPHHIDVRRVMGSSGCLDAQQTKESTQTNTPQQLRKRDVESGSSSTSPKCTREGSTACGCCRKSSCSAGCSTHLLTTNEIQLCTALNLPATQYVTLKGVLLRKPAQSPDADVDRAVRKYLSNAGWLHH-