Monarch geneset OGS2.0

DPOGS213467
TranscriptDPOGS213467-TA1206 bp
ProteinDPOGS213467-PA401 aa
Genomic positionDPSCF300100 - 316940-325551
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0168342e-16382.85% 
BombyxBGIBMGA004491-TA1e-4696.67% 
DrosophilaAP-2-PB1e-11158.60% 
EBI UniRef50UniRef50_Q174D71e-12464.71%Transcription factor ap-2 n=16 Tax=Eumetazoa RepID=Q174D7_AEDAE
NCBI RefSeqXP_001652342.12e-12564.71%transcription factor ap-2 [Aedes aegypti]
NCBI nr blastpgi|1571146214e-12464.71%transcription factor ap-2 [Aedes aegypti]
NCBI nr blastxgi|1700561785e-13061.81%transcription factor ap-2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00056343.2e-163nucleus
GO:00063553.2e-163regulation of transcription, DNA-dependent
GO:00037003.2e-163sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[38-395] IPR0049793.2e-163Transcription factor AP-2
[176-380] IPR0138541.9e-94Transcription factor AP-2, C-terminal
Orthology groupMCL11633 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213467-TA
ATGGAACTCAATCATCAGAACCTTCACATTTACAAGAGCATCCATGAGCGTCTGGGCGGCGGCGGGCTGGGTTTGGGCGGCGGAGGCTTCCGCGGCGCTCAACCCTCGCTCGCCGACTTCCAGCCGCCATACTTCCCGCCGCCCTTCGCCCCTAGCGCACATCCTGCGAGTCCGCACCACCAACAACAGAGCCATGGCATGGAGTATTCTGGAGGTCCGGAGTACGGGCAGCACTACGCGCCGCAGCAGCTCCTACCACGACACCACGGACACGAGCCTCCCCATCTCAGACATCATAGAGACCATCACGACGTACACTCCCACCACCTACCTCACGGTGGATTCAGTTACGACAGGAGGACGGACTACGGAGCCCGGGAGCAACACGACCTCGCCCTTCATCACGCGTTACACACGGACGAGACACAGAATGCAGGCATGGACGATACGACGGGCTTCATGACCGACCTTCCTTTATTAAAAACAATGAAAGCCCGCGATGTAGGGACAGGTGCCTGCGCCCCCAGCGACGTGTTCTGCTCTGTACCAGGGAGACTCTCTCTCCTGTCTTCGACCAGCAAGTACAAGGTCACCGTCGCCGAGGTCCAGCGAAGGCTCTCACCACCAGAGTGCCTGAACGCGTCACTACTCGGAGGTGTACTGAGAAGAGCAAAAAGCAAAAATGGCGGTAGGTTACTTAGGGAAAAACTAGAGAAAATCGGTCTGAACCTTCCAGCGGGGCGACGGAAAGCGGCTAACGTGACGCTACTCACGTCATTAGTAGAAGCCGAGGCTGTTCATTTGGCGCGTGATTTTGGTTACGTCTGCGAGACTGAGTTCCCGGCCCGAGCGCTCGCGGAATACCTCGCGAGACAATACGCTGAACACGACGCCAGACGACGCAGGGACCTGTTACACGCCACCAAACAGGTGGTGAAGGAGGTGATGGACCTATTGAACCAGGACCGTTCTCCTCTGTGTAACACGCGACCCCCTCACCTCTTGGAGCCGGCCATACAGCGGCACCTCACACACTTCTCTCTCATATCACACGGTTTCGGTGGACCGGCCATCGTCGCCGCACTGACAGCCATACAGAATTTCCTAAACGAGTCGTTAAAGCATTTAGACAAGTTATATCCACAGAGCGGGATGGTGTCGTCGACAATGGACAAGACAAAAATGGATCCCGACATCAAAAAGTAG

Protein sequence:

>DPOGS213467-PA
MELNHQNLHIYKSIHERLGGGGLGLGGGGFRGAQPSLADFQPPYFPPPFAPSAHPASPHHQQQSHGMEYSGGPEYGQHYAPQQLLPRHHGHEPPHLRHHRDHHDVHSHHLPHGGFSYDRRTDYGAREQHDLALHHALHTDETQNAGMDDTTGFMTDLPLLKTMKARDVGTGACAPSDVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRLLREKLEKIGLNLPAGRRKAANVTLLTSLVEAEAVHLARDFGYVCETEFPARALAEYLARQYAEHDARRRRDLLHATKQVVKEVMDLLNQDRSPLCNTRPPHLLEPAIQRHLTHFSLISHGFGGPAIVAALTAIQNFLNESLKHLDKLYPQSGMVSSTMDKTKMDPDIKK-