Monarch geneset OGS2.0

DPOGS207344
TranscriptDPOGS207344-TA2193 bp
ProteinDPOGS207344-PA730 aa
Genomic positionDPSCF300188 + 180524-213844
RNAseq coverage954x (Rank: top 13%)
Annotation
HeliconiusHMEL0022064e-9287.80% 
BombyxBGIBMGA010273-TA3e-13772.25% 
Drosophilaosa-PB2e-8848.61% 
EBI UniRef50UniRef50_E2AVY32e-9454.18%Trithorax group protein osa n=8 Tax=Coelomata RepID=E2AVY3_CAMFO
NCBI RefSeqXP_002056764.15e-9353.15%GJ24712 [Drosophila virilis]
NCBI nr blastpgi|3072101694e-9453.99%Trithorax group protein osa [Harpegnathos saltator]
NCBI nr blastxgi|2420057242e-12941.72%trithorax group protein osa, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00036771e-38DNA binding
GO:00056221e-38intracellular
KEGG pathway 
InterPro domain[320-411] IPR0016061e-38ARID/BRIGHT DNA-binding domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207344-TA
ATGGCACAAGATGGCATGTTGAACGGAGACGTTGACGAGATATGTCGAGGCTACCCTCCACCACCGAGACCAGCTCAGCCATCAACACCCAATGCACACGATCAGGATGCGGATCTGACCGGTCAGAACAGCAATGACTCCGGCGGCAGCGGTGGCGCGGGGCGCGCTTCCACGCCGCACTTGAGGCCCACACCGAGCCCCACCGGCTCCAGTGGATCGCGATCCATGTCACCGGCTGTCGGTACCCAGAATCTACCGATGCCACCGCGGCCGTCGTCGTCTCTATCGGACGGCAGCGGGCCGACCGTGCGAGCAGGCACCGGCGCCCCCGCTGCTGGCCCTCCGCCCTCGGGCGCTCCGCCCCCCGGGGCCATGCTGCCGCAGCCTTACCCGCATCACGGCCCGTACAAGACGGCGCCCTACCCGCCGCAGCCCTACGGCTACCCCCCGCGCAACCACCACCCCTATCCTTACGGAGGATACAGGCCGACACCTCCGCCACATCCCTCACAACATTACCCGCCACTCAAGCAGGCGGGTCGTCACATGGGTCCGCCGGCGGGAGCTGGCGAGGCGATGCCGCCGCCGACCGCACCCGGGGAGCCCCATGACAACGGCCCCGCGGCGCCCGCCACCGCCCTCGTCACCACCGGCCCCGACGGAGCGCCGCTGGACGAGGGCAGCCAGCAGAGCACGCTCAGCAACGCCTCAGCTGCTTCCGGCGAGGAGACGTGCGGTACACCCAAGAGTCGCAAGGAGTACGGCGCGGGCAGTGCGGCGCCCTCGCCCTCGCCCGGCGGGGCCTCGCACTCGTCCGTACACGACGAATACGACGCCTCCGCCTCGCCCTGGCCGAGACCGCCATCTAGTCCCGTATTTAACAGTCACATAGCGCCGGAGTCCTACAGATCAAAGAAGTCGGACTCGCTGGGCAAGCTGTACGAGATGGACGACGCGCCGGAGAGGAGGGGCTGGGTGGAGAGACTGCTGGCCTTCATGGACGAGAGGCGCACGCCCATCGCCGCCTGCCCCACCATCAGCAAGCAGCCGCTCGACCTCTACAGGCTGTACCTGCTGGTGCGGGACCGCGGGGGATTCGTCGAGGTCACTAAAAATAAAACGTGGAAAGACATAGCCGGTTTACTCGGCATCGGCGCGTCGTCGTCGGCCGCTTACACCCTGAGGAAGCATTACACGAAGAACCTGTTGGCGTACGAGTGTCACTTCGACCGCGGCGGCATCGACCCCCAGCCCATCATCAGCCAGGTGGAGGCGTCCACGAAGAAGAAGAGCGGGAAGGCCAACAGCACCTCCAGCGCAGGGTCGTCAAACTCCCAGGAGCAGTTCCCGGGCGGCGCGGCGGACGGCTATCCGTCACACGGCGCGCACCCCGCACACTACGCACCCTACCCCCCGCAGCCGAGCCAACCGCAGGGCGGCGGGCCGGGCGGCGACAACCTCGCCGCCTCCAACCCATTCGACGAGCCGCCGGGGCCCAGGCGACCCCCAGGTTACCAACAAGGTTACGGGTACGAGTACGGCTCGCCCTACCCCACCAACAGGCCGGTGTATCCGCCCTACGGTCCGGAAGGAGACAGGGGTTACGGCGGTAGCGGGGAATACCGCTACGGGTATGGCGGCTATCGTGCGGGGGCGCCCGCGGCCGGAGCTCCGCCCTCACAGCCCGCGCCCGCGCAGCCCGCACCGGCTCAGCCGGCGCAGCCCTACCCGGACTACTACCGCGCGCCTCACCCTCACGCGCACCCTCACCCGCCGCACCCGCCGCACCCCCCGCACCCTCCACACCCGCCGCACTCGCCGCAGCAGCAGCACGACGTGAGTACTACCCTAGTACACACACGGCACGCCGCGTCCCCGACACCGGGCCGCGTGTCGGGTCGTGTCGTGTCGTGTCGTGTGGTGTCGTACCGTCCGTGTGCCTCCTACCAGCTGTCGTCGGGCGCGGGTCTGCTGAGCTCGCAGCTCGCTCGTCAGCTCGTGGCGCCGCTGCCGTCCCCGCGGAGCGCCTACGTACGGACGTTTGTACCACGCACTCAACACCGCTTCCCATATTGGCGACCGGTCCATTGCACCGACCCTCCACACACACACACACACGAACAGTCACACACACGCGCGGGTCATGATTGGGCGCACCGAACAATCCCCCCCACCCCGCCATCCTCCCTCTAG

Protein sequence:

>DPOGS207344-PA
MAQDGMLNGDVDEICRGYPPPPRPAQPSTPNAHDQDADLTGQNSNDSGGSGGAGRASTPHLRPTPSPTGSSGSRSMSPAVGTQNLPMPPRPSSSLSDGSGPTVRAGTGAPAAGPPPSGAPPPGAMLPQPYPHHGPYKTAPYPPQPYGYPPRNHHPYPYGGYRPTPPPHPSQHYPPLKQAGRHMGPPAGAGEAMPPPTAPGEPHDNGPAAPATALVTTGPDGAPLDEGSQQSTLSNASAASGEETCGTPKSRKEYGAGSAAPSPSPGGASHSSVHDEYDASASPWPRPPSSPVFNSHIAPESYRSKKSDSLGKLYEMDDAPERRGWVERLLAFMDERRTPIAACPTISKQPLDLYRLYLLVRDRGGFVEVTKNKTWKDIAGLLGIGASSSAAYTLRKHYTKNLLAYECHFDRGGIDPQPIISQVEASTKKKSGKANSTSSAGSSNSQEQFPGGAADGYPSHGAHPAHYAPYPPQPSQPQGGGPGGDNLAASNPFDEPPGPRRPPGYQQGYGYEYGSPYPTNRPVYPPYGPEGDRGYGGSGEYRYGYGGYRAGAPAAGAPPSQPAPAQPAPAQPAQPYPDYYRAPHPHAHPHPPHPPHPPHPPHPPHSPQQQHDVSTTLVHTRHAASPTPGRVSGRVVSCRVVSYRPCASYQLSSGAGLLSSQLARQLVAPLPSPRSAYVRTFVPRTQHRFPYWRPVHCTDPPHTHTHEQSHTRAGHDWAHRTIPPTPPSSL-