Monarch geneset OGS2.0

DPOGS211454
TranscriptDPOGS211454-TA1308 bp
ProteinDPOGS211454-PA435 aa
Genomic positionDPSCF300223 + 98493-105507
RNAseq coverage161x (Rank: top 52%)
Annotation
HeliconiusHMEL0074553e-10366.99% 
BombyxBGIBMGA002159-TA7e-6651.99% 
Drosophilapnr-PA6e-6361.46% 
EBI UniRef50UniRef50_D6WKR93e-6645.14%Pannier n=2 Tax=Tribolium castaneum RepID=D6WKR9_TRICA
NCBI RefSeqXP_973051.16e-6746.09%PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
NCBI nr blastpgi|910948851e-6546.09%PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
NCBI nr blastxgi|910948856e-7047.49%PREDICTED: similar to AGAP002235-PA [Tribolium castaneum]
Group
Gene OntologyGO:00063552.4e-25regulation of transcription, DNA-dependent
GO:00082702.4e-25zinc ion binding
GO:00037002.4e-25sequence-specific DNA binding transcription factor activity
GO:00435651.3e-21sequence-specific DNA binding
KEGG pathway 
InterPro domain[211-268] IPR0130882.4e-25Zinc finger, NHR/GATA-type
[210-260] IPR0006791.3e-21Zinc finger, GATA-type
Orthology groupMCL26748 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211454-TA
ATGTTCGGCAGCAGCGGCAGTGGCGGCTACGGGGATGGGGAGGGCTACGGACTGGGCTACGTCGGAGGGGGAAGGCTCCAACAGTACGCGCACTTCGCCCCTCATCAGACATGGCATCATCACCCCGCTCACCACGACACCTACGGTACCAGACCCGCGCCGCGGTGCTCCGATCAGTCCGCGGCGACCTCTGACCGCGACGGTCGGTGTGACCTGCCACCTCACCTGCCACCTCTCGACATGGATCTGCAGTTAACTCAGTCTTGGTTACAAAACATAGCCCTCATCGCAGGAGGGAGCGGTGTGGGCAGCGTGGGCACCGTGGGCGGCGTCGGTAGTGTGGGCGGTGTGGGTGGCGTGGGCGGTGTGGGCGGTCTGTACCCCCAGAACATGGTGATGGGATCTTGGTGCGGGCCCTACGATGCCCTCCAAAGACCTCCAGCTTACGATGGAGTGCTGGAGGCGTACGAGGAGGGTCGCGAGTGTGTGAACTGCGGCGCCAACAACACGCCGCTGTGGCGCCGCGACTCCACCGGCCACTACCTATGCAACGCGTGCGGTCTCTACCACAAGATCAACGGAGTGAACCGGCCGCTCGTGAAGCCGAGCAAGCGGCTGTCAGCGGCTCGTCGACACGGTCAAAGCTGCACCAACTGTGGCTCCAGGAACACGACCCTCTGGAGGAGGAACAACGAGGGCGAGCCCGTCTGTAACGCGTGTGGACTCTACTACAAGCTCCATGGAATCAATAGACCTCTGGCTATGAGGAAAGATGGAATACAAACCAGGAAACGTAAGCCGAAGAAATCGGCGAACGGCGTGAAGCCTTCACCGGAGACGACTAAGAAGGACGAGCAGACGTCACCGGGAGTGGACGAGAGCAAGCCCAGTATACCAGAGGTTCCGTTGCCTCTGACAGCTACCCTATCGGGGCACTCCTCGAGCAAGTCCCGCGAGCCTCTCGGTGCTACGCCCTCGCCGCATGCACACAGTCACTCGCACTCTCACGCACACTCCCACACACACTCCCACACACACTCGCAGAACTCTCAGTATCCCCTGGCGCTGCCCTCTGCCCCCGCCTTCCTCTCCAACCCTTCGCTGTTCAACATCAAGAGCGAACCGAACGCAGCCTCCGGCTACGAGGGTTACGGCTCGCACGCCGCCTCTAACGGCCCGTATCACTCTCAACAGCACTACCTGCACGCCTTACAGTACGGCCTGGGCGGTGCTGAGGAGGAGGAGGGCAGCGGGTTCCTTCATCAGCGGAACGTGACCGCACACGCCAAGCTCATGGCCTCCACGTAG

Protein sequence:

>DPOGS211454-PA
MFGSSGSGGYGDGEGYGLGYVGGGRLQQYAHFAPHQTWHHHPAHHDTYGTRPAPRCSDQSAATSDRDGRCDLPPHLPPLDMDLQLTQSWLQNIALIAGGSGVGSVGTVGGVGSVGGVGGVGGVGGLYPQNMVMGSWCGPYDALQRPPAYDGVLEAYEEGRECVNCGANNTPLWRRDSTGHYLCNACGLYHKINGVNRPLVKPSKRLSAARRHGQSCTNCGSRNTTLWRRNNEGEPVCNACGLYYKLHGINRPLAMRKDGIQTRKRKPKKSANGVKPSPETTKKDEQTSPGVDESKPSIPEVPLPLTATLSGHSSSKSREPLGATPSPHAHSHSHSHAHSHTHSHTHSQNSQYPLALPSAPAFLSNPSLFNIKSEPNAASGYEGYGSHAASNGPYHSQQHYLHALQYGLGGAEEEEGSGFLHQRNVTAHAKLMAST-