Monarch geneset OGS2.0

DPOGS212849
TranscriptDPOGS212849-TA1095 bp
ProteinDPOGS212849-PA364 aa
Genomic positionDPSCF300086 + 227285-228491
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0090651e-2233.54% 
BombyxBGIBMGA000792-TA6e-4540.59% 
DrosophilaMeics-PA8e-2432.30% 
EBI UniRef50UniRef50_E9IBG21e-3544.94%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IBG2_SOLIN
NCBI RefSeqXP_001119865.11e-2640.25%PREDICTED: similar to zinc finger protein 585A [Apis mellifera]
NCBI nr blastpgi|3228013744e-3544.94%hypothetical protein SINV_01211 [Solenopsis invicta]
NCBI nr blastxgi|3228013745e-4029.22%hypothetical protein SINV_01211 [Solenopsis invicta]
Group
Gene OntologyGO:00036769.4e-09nucleic acid binding
KEGG pathway 
InterPro domain[242-269] IPR0130879.4e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212849-TA
ATGTGTAAATATTGTAATAAAGAGTTTCCCGAGAAGAGTTACGGAGAGCACCTGGAGACGATACACTCGGGTAGTTTCTTCCGGTGTCACGAGTGCGACGGTTGCATCGACCGGAGCAGCTTCGTCGTTCACATGACGCTGCACGCTGTCGAATACGCCAAACATAAAGACAGGAAGACGAAAAGAATCAAACAGAATCTCGACCGAGGTCCGGCGAGGAACCAGAAATGTGATGCTATCGGGAGTGACGTCCCAGAGAAGGTGACAAAGAACGAACATCGAGTAGAGGGTGGTGATGAGGAATCAAGAAGTTATGAAGAGGTACACGAGGAATTCTGTGAACCGAGCAACGTGGAAGATTTTGGTCCCTTACCAGAATCGGTATTCGAAGCTATCGAAGACTCGAGAGAGTATCAAGAGGTAGAACAACATGTTCATCACAGCGACGCGGAGGCTCAGGACGATATACAAACATTAGAAATTAAAAATAGTCCACAATCCCCTGTGGCAGAACAGTCACAAAAACAGAGAACCAGATCTGATAATGCGGAAGTTCATAATGTGAAAAGCAAAATCAGAAAATGTCCGAAATGCGATAAAGTATACGTCGCCTCGTCGAGCTACTTCTACCATCTCAAATACTTCCACAACCAGAACAAGGAGCACGAGTGCGACGTCTGCGGGAAAAAGTTCGGCACCAAGGCCGGTCTAGCATCACACACGGCCATACACGGAGGAGACTGGCGGTACGCGTGCAGGGAGTGCGACAAGCGGTTCAGGACTAGGGCCAGTCTATACATACACCAACAGACGCACAGCGGAGTCAAATCGCACAGATGTTCACGATGCGGTAGATCCTTCAGGTGGCGCACGCACCTGCGGCGGCATGAGACGCGACACCTCGCCCAGAAGTCGCACGTCTGCGAGACCTGCGGCCGCGGGTTCAGTGTGCGCTGCGACCTGCTGCGACACGCGCGGACGCACGCGGCCGGCACGCACGTGTGCGACGTCTGCGGACACACCTTCGCTCAGCTCAGGTACCTCAGAGTCCACACGTGTTGGCCCGCGCTGCCCTCCGAGTCTGAAGAGACGTAA

Protein sequence:

>DPOGS212849-PA
MCKYCNKEFPEKSYGEHLETIHSGSFFRCHECDGCIDRSSFVVHMTLHAVEYAKHKDRKTKRIKQNLDRGPARNQKCDAIGSDVPEKVTKNEHRVEGGDEESRSYEEVHEEFCEPSNVEDFGPLPESVFEAIEDSREYQEVEQHVHHSDAEAQDDIQTLEIKNSPQSPVAEQSQKQRTRSDNAEVHNVKSKIRKCPKCDKVYVASSSYFYHLKYFHNQNKEHECDVCGKKFGTKAGLASHTAIHGGDWRYACRECDKRFRTRASLYIHQQTHSGVKSHRCSRCGRSFRWRTHLRRHETRHLAQKSHVCETCGRGFSVRCDLLRHARTHAAGTHVCDVCGHTFAQLRYLRVHTCWPALPSESEET-