Monarch geneset OGS2.0

DPOGS211451
TranscriptDPOGS211451-TA1962 bp
ProteinDPOGS211451-PA653 aa
Genomic positionDPSCF300223 + 57186-60065
RNAseq coverage174x (Rank: top 50%)
Annotation
HeliconiusHMEL0074555e-1943.75% 
BombyxBGIBMGA002157-TA0.073.20% 
Drosophilapnr-PA6e-4549.71% 
EBI UniRef50UniRef50_A0NBA02e-5236.71%AGAP002236-PA n=1 Tax=Anopheles gambiae RepID=A0NBA0_ANOGA
NCBI RefSeqXP_001849613.16e-5338.78%GATA transcription factor GATAc [Culex quinquefasciatus]
NCBI nr blastpgi|3320165523e-5933.33%GATA-binding factor A [Acromyrmex echinatior]
NCBI nr blastxgi|3320165529e-6633.43%GATA-binding factor A [Acromyrmex echinatior]
Group
Gene OntologyGO:00063552.3e-18regulation of transcription, DNA-dependent
GO:00082702.3e-18zinc ion binding
GO:00037002.3e-18sequence-specific DNA binding transcription factor activity
GO:00435658.6e-17sequence-specific DNA binding
KEGG pathway 
InterPro domain[454-500] IPR0130882.3e-18Zinc finger, NHR/GATA-type
[452-503] IPR0006798.6e-17Zinc finger, GATA-type
Orthology groupMCL34890 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211451-TA
ATGGAAAATGTATCTTCAATGATTCAAAGAAACGAAGAACAAAGAGGAGAGAACGCAGAGGCCCTACAACTTGTTAAAAATGAGGAACGTTCGGATATGAGCATCAACGACTCAAATACAAATGGCGAAGGATTGTCAACAGTCGTATCAGCGTCAGGAGCGGAAACGGGGGCCGAGCCACACTCAGTGATCACATCGAGGCGTGCACAAAGAACGATAACTACTGCGGGCCATATAACAGATTCAACGGATATAAGCGACGACGGAGTCAAAGACGAACCTCCAGAACAAACGCCGCTGGATCATGACCAGATGCAGTACCCGTCCAAGATGGAGGAAGTCGATCAACGCTCGAGCGCCACACATTACGACGAAGAAAGGTTAAGACTGGCTGAGGCAGAAACGGCACGATATCTCGCTTTTAAACAGGAGCCCTACAGAGGTTTACAGTCTCCATCTCCCGCCCCACAAGATAAACGACCCCCGCAGTACTCACGACTACCTCCTCCGCGGCCTGCTTATGGCCGAGGGCTGTCCCTGCGGTATGGACCTCCTCACCATGTGATAGTATCACAGGACGGAGAGCCCGAGGAACATCCGCATGAAGTTTATCTGCAAAAAGAAAAAGAACAATTACAGCAAGCATACGTATCAAACGAAGCGAACGCGCAGGAAAATAGACAATATGCCGCCGTGCTCGGAGATCAGGTAGCCGTCTCGTCAGCGCTGGAAATGATTCAAACGACATCTTTGAGTAATCAACAGTCGATCGGAGTGGCATATCAACAGGTAAAGTATGAATCGAGAGGTGAAGCCGAGCCCCGGCCGACGACGTACGCCAGCCTACAACCGGTGACGTCGGTACATAGCTCGAGCGGGTACACGTACGCGGGACAGAGTCCTCAGTACGCGGCGGCCGGGTATGGTGCTTACGGTGGCAGCAAAGAGCTGCTGACTCTGTACGGCGCGGGTGGTAGTGCGGGCGGGGCGCCACGGGGGGACGACTCGCCCCCGGGACAGCTGCTCTACCGCAGCGACCCCACGCTCTCCTCCTCGTCTCTGAACACGAGGGCGCACGTCGTCTACGGGTCCGTGGTTCCTCAGTCACAGACGGTCTACGAGACACCGCCCAGTCCAAACTCACAGCAGGTGACACTATACACTCACGGGAATACCGTCCAGTATAAAGTGGGCGGCGAGCACTATCTAGGACAGGGCAGCGGAGTGGAATACGTCCCGGTGTCGGGTTACGAGGGAGGTCTGCTGGTGGAGAGCTACCCTGCCGCGCCTCAACCCTGGCCAGCACACAACATACTTAATATAGATGATGGATTCGATCCCAGCATGGCTGGTATGGGGGAGGTGAAGGAGTGCGTGAACTGTGCGGCGGCCACGACTCCTCTCTGGAGAAGAGACGGCACCGGTCATTACTTGTGCAACGCCTGCGGGCTATACACCAGAATAAATGGCGTCAACCGACCACCACTGAAAGGACAAAAGACGAAGCCACAACAGGCTCTCCCTACTAACGGCAACCGTCGTGTCGGAGTGACCTGCGCCAACTGCCGCACCTCCAACACGACTCTCTGGAGGAGAAACAACAACGGCGAACCCGTCTGTAACGCGTGTGGCCTCTACTACAAGCTGCATAATATAGTTGATATCAATACAACACGCGGTCGAGCATCGATTACAGTAGTAGTAGTGGAGTCATTAGTAGACGACGCGGAGGAAGCCAGGTATCTAATCGGAGCCGTGCAGCAGTTATCAGGGGACTCCTCGGCAAATATTGGGCCTGTCAACCGTCCGCTAAGCATGAAGAAGGACGGCATCCAGACGAGGAAGCGCAAGCCGAAGAGCATGGGCAGCGGCGGGGCCTCGGGCGTAGCGCGCGGTGCTCTGACCGGTACGTTGACTCCCCTAGCCAGCGAGTGGGCCGCCTGCGCCTGGGCTGGCGAATGA

Protein sequence:

>DPOGS211451-PA
MENVSSMIQRNEEQRGENAEALQLVKNEERSDMSINDSNTNGEGLSTVVSASGAETGAEPHSVITSRRAQRTITTAGHITDSTDISDDGVKDEPPEQTPLDHDQMQYPSKMEEVDQRSSATHYDEERLRLAEAETARYLAFKQEPYRGLQSPSPAPQDKRPPQYSRLPPPRPAYGRGLSLRYGPPHHVIVSQDGEPEEHPHEVYLQKEKEQLQQAYVSNEANAQENRQYAAVLGDQVAVSSALEMIQTTSLSNQQSIGVAYQQVKYESRGEAEPRPTTYASLQPVTSVHSSSGYTYAGQSPQYAAAGYGAYGGSKELLTLYGAGGSAGGAPRGDDSPPGQLLYRSDPTLSSSSLNTRAHVVYGSVVPQSQTVYETPPSPNSQQVTLYTHGNTVQYKVGGEHYLGQGSGVEYVPVSGYEGGLLVESYPAAPQPWPAHNILNIDDGFDPSMAGMGEVKECVNCAAATTPLWRRDGTGHYLCNACGLYTRINGVNRPPLKGQKTKPQQALPTNGNRRVGVTCANCRTSNTTLWRRNNNGEPVCNACGLYYKLHNIVDINTTRGRASITVVVVESLVDDAEEARYLIGAVQQLSGDSSANIGPVNRPLSMKKDGIQTRKRKPKSMGSGGASGVARGALTGTLTPLASEWAACAWAGE-