Monarch geneset OGS2.0

DPOGS203270
TranscriptDPOGS203270-TA915 bp
ProteinDPOGS203270-PA304 aa
Genomic positionDPSCF300229 + 111119-120339
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0074523e-1339.56% 
BombyxBGIBMGA000449-TA3e-9393.41% 
Drosophilagrn-PB2e-2993.33% 
EBI UniRef50UniRef50_D7EHV32e-5154.10%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EHV3_TRICA
NCBI RefSeqNP_001158260.12e-5254.10%grain [Tribolium castaneum]
NCBI nr blastpgi|2586450943e-5154.10%grain [Tribolium castaneum]
NCBI nr blastxgi|2700157178e-5454.10%hypothetical protein TcasGA2_TC002315 [Tribolium castaneum]
Group
Gene OntologyGO:00063551.2e-22regulation of transcription, DNA-dependent
GO:00082701.2e-22zinc ion binding
GO:00037001.2e-22sequence-specific DNA binding transcription factor activity
GO:00435652.2e-21sequence-specific DNA binding
KEGG pathway 
InterPro domain[197-248] IPR0130881.2e-22Zinc finger, NHR/GATA-type
[197-247] IPR0006792.2e-21Zinc finger, GATA-type
Orthology groupMCL20688 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203270-TA
ATGACTTTTGTAATTTTGGAAAAGCCCGGCCTTTTGTTCTCGGACGCAGCCAGCGAGACTGGTCGAACGCTAAGCGGGAGTAGCAGTCAGGTCTGCAGGCCGCACTTCCACACGCCGTTGCATCCGTGGCTTGATTCCAAAGCCCTCGGCAGCGGTGCTTGGGGCGGGGGCTTCCAACAGGATGCGCCCGCCGATAAGCCCCTGTCCCCTGCACAGCATCCTCATCCTCATCCCCTCTTTTCCTTCCCTCCCACACCCCCAAAAGACTCCACCCCAGATTCCGTGGCAGCTGCGGCGTTAGGGCAACAAGATTATCAAGCAGCTGTAGCGCACGCCACGGCGATGAGCGTCGCGTTCATGCAGCAGCAACAGGAGCTTCCTCTATGCGGAGACGTGAAGCCCATGATGGGCGCGCCGCCTTCGAAGCAACGCGAAGGTGCGCAGCCGGACTCGTGTCCCCAGGAGAGTCAGCCCTCTGGGGAATACAACGCTCAGTACAACGCCGCCGCCGGCGACTACTACAACTATCAAGGGAAGGGTTATAGCGGGCAGCAACAGAGTAAGCCCCGGCCTAACAAGACTCGAACTAGCGCCGAGGGTCGTGAGTGCGTCAACTGCGGTGCGACCAGTACTCCTCTGTGGAGAAGAGACGGGACTGGACACTACTTGTGCAACGCCTGTGGACTCTACTACAAGATGAACGGACAAAATCGACCCCTCATCAAGCCCAAGCGAAGACTGTGTCATTTTAACAACATAAAACGAAACCCGAAATGGGTGTTGTTTACAGTTAGGGGCAGCAATAAAAAGGCGGGGCATGAATTCGAGCACGACAGCACATTAAGTGCTCCCACAAATTGGGCTCTTAAGAAACCTGAGCGTTTCGACGAAAGAGGACACACCGTCCAGAACTAG

Protein sequence:

>DPOGS203270-PA
MTFVILEKPGLLFSDAASETGRTLSGSSSQVCRPHFHTPLHPWLDSKALGSGAWGGGFQQDAPADKPLSPAQHPHPHPLFSFPPTPPKDSTPDSVAAAALGQQDYQAAVAHATAMSVAFMQQQQELPLCGDVKPMMGAPPSKQREGAQPDSCPQESQPSGEYNAQYNAAAGDYYNYQGKGYSGQQQSKPRPNKTRTSAEGRECVNCGATSTPLWRRDGTGHYLCNACGLYYKMNGQNRPLIKPKRRLCHFNNIKRNPKWVLFTVRGSNKKAGHEFEHDSTLSAPTNWALKKPERFDERGHTVQN-