Monarch geneset OGS2.0

DPOGS214135
TranscriptDPOGS214135-TA1167 bp
ProteinDPOGS214135-PA388 aa
Genomic positionDPSCF300014 - 1302733-1305330
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0056714e-17278.37% 
BombyxBGIBMGA006182-TA7e-16973.42% 
Drosophilagcm-PA5e-6268.21% 
EBI UniRef50UniRef50_D6WJ636e-8451.45%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJ63_TRICA
NCBI RefSeqXP_975103.22e-8452.91%PREDICTED: similar to AGAP007783-PA [Tribolium castaneum]
NCBI nr blastpgi|2700079822e-8351.45%hypothetical protein TcasGA2_TC014730 [Tribolium castaneum]
NCBI nr blastxgi|1892378674e-8653.29%PREDICTED: similar to AGAP007783-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036778e-82DNA binding
GO:00063558e-82regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[1-119] IPR0039028e-82Transcription regulator, GCM-like
Orthology groupMCL15671 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214135-TA
ATGAGGAACACCAACAACCACAACGTCCACATCCTCAAGAAGAGCTGCCTCGGAGTGCTCGTGTGCTCGTCGAGATGTCGCCTGCCCGACGGCAGTAGAGTACACTTACGACCGGCCATCTGTGATAAGGCGAGGAAGAAACAACAAGGCAAGCCCTGCCCCAACCGCGTGTGCAACGGGGGCCGGCTCGAGGTCCAACCGTGTCGAGGGCACTGCGGCTACCCGGTCACACACTTCTGGCGCCACACGGAACACGCGATATTCTTCCAAGCCAAGGGATCCCACGACCATCCTCGACCCGAGGCCAAGGGAGCCAGCGAGGTGCGGAGGTCGCTCGGCGCCGGCAGACGGGTCCGCGGGCTTGCCCTACTACTGGCGAGGGAGGCCGCCATCGCTGACAAAATACTGACCGTCAAACAAGACAAGCAAATGACACACAAGAATAACATCCACGCACCGCCCCCGCCGCTCATACCAGACAACCAACGATCCTTAACCTGCACGTGCGGACCGTTCGAATGCAACTGTCGGTGGCGCTCCGAGTCGGCGGCGGAGCCCTACGCGGCGGCGGCGTGGACCCCCAGCGAGCACCAGTCCTACACCGCTTACATTCCCCCCATACAGCCCACACCGGCGGCTGCGTCACACACGTACGACCCCACCTCCTTGCCGGCGGACGACATCTTCCATCCAGAGGAGATATTCCAATTGGATCAACCGATCCGCCTAGACTTCCCCCTCGAGGACAATACCTTGGAGTCTCCGCCGACGTTCGCGGATCTCAACAGCGACACCTCCAGAGCCGACGACGCCTATTGGTTGGAGTGGCAGAGACCCGCGGTCGGCTCGGACGCCAGCGACACGCCGTCCCCCGAGCTCTTCAACAACTACCAACAGACGGAGCCCTACTGCGAGCAGCCCTACATACACCAGAACTACTACCCGGAGGAGGCGCAGTACTACCCGATGGAGCACATGAGAGAGTCGCCGGGGATGGAGGGTCAGGGGCAGAGGTACTTCCGGTACGGTCAGGACTGCGAGCCCAACAACATGGACGTCCACACGTGGAACTACTCCGACTGCGCGTTCAGCAACCAGCCGGCGGAGTCCAAGGAGTACTACAACGTGCACTCGCTGAGCGTGAACACCTTCAACCCGCTGTTATAA

Protein sequence:

>DPOGS214135-PA
MRNTNNHNVHILKKSCLGVLVCSSRCRLPDGSRVHLRPAICDKARKKQQGKPCPNRVCNGGRLEVQPCRGHCGYPVTHFWRHTEHAIFFQAKGSHDHPRPEAKGASEVRRSLGAGRRVRGLALLLAREAAIADKILTVKQDKQMTHKNNIHAPPPPLIPDNQRSLTCTCGPFECNCRWRSESAAEPYAAAAWTPSEHQSYTAYIPPIQPTPAAASHTYDPTSLPADDIFHPEEIFQLDQPIRLDFPLEDNTLESPPTFADLNSDTSRADDAYWLEWQRPAVGSDASDTPSPELFNNYQQTEPYCEQPYIHQNYYPEEAQYYPMEHMRESPGMEGQGQRYFRYGQDCEPNNMDVHTWNYSDCAFSNQPAESKEYYNVHSLSVNTFNPLL-