Monarch geneset OGS2.0

DPOGS200305
TranscriptDPOGS200305-TA1407 bp
ProteinDPOGS200305-PA468 aa
Genomic positionDPSCF300026 - 242445-245385
RNAseq coverage56x (Rank: top 69%)
Annotation
HeliconiusHMEL0020410.079.38% 
BombyxBGIBMGA005627-TA1e-16772.75% 
DrosophilaCG7368-PB9e-7053.33% 
EBI UniRef50UniRef50_E0VTC11e-8244.86%Putative uncharacterized protein n=8 Tax=Neoptera RepID=E0VTC1_PEDHC
NCBI RefSeqXP_002429365.12e-8344.86%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420177814e-8244.86%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|1892382236e-9345.88%PREDICTED: similar to CG7368 CG7368-PB [Tribolium castaneum]
Group
Gene OntologyGO:00036761.2e-09nucleic acid binding
KEGG pathway 
InterPro domain[378-407] IPR0130871.2e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16947 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200305-TA
ATGAACTTCGCTCCCTTCTCCGGTCACACGCCGGCCGCGTCCCAGATCCCGACGATCCACCAGTTCGCAGCGAAGTTCGGTTATGAAGGCAGCGGTAGCACAAGCAGCAGTGTGGTGCAAGAAACGAGATACCAGGCCACCGGGAGTAATGTCCAGTTCGCTGTCACGCCGTCGAGTGGAGTCACTGTGGACGGCGTGAAGTTCAGACAGGCCGAGAGTGTTGTAGTTGTCAGCAACTCTCAGCCAGGATCTGTTACACTAGCTGGTATACCGCCCATACATTCTGTCAATGCGGGTGTGAAATACCAGAGTATAACAGCAACTTCGTCAACTGTGAACGTTGGTAACATCGCGTATGGAGGGACTGGTCAAGTGGGGTACGTTCAGGAGACGGTGTTAAAGGGGGAGCAGAGGGACAAGAGAGAGTACAGAGAGGAACACCATGATATCATGACTTCTATGCTGCAGCAACATCAAGTGAGTCAAGCGCCTCTGCAGCAGGGCACACAGCAATTGGTGGTAGTCCTGCACGAGCCCAAGGAGGGCGCAGCACTAGCTCGGGCCATCAAGCAGGAGGGACGGAACCAACGTGAATTCTCCAATGTACCCACGGAGTTTATTAGTATAGGTGAAGTGGAAGGGACATGGCTGGGAGGCGGCGTGGCGGACTACCTCTCAGCACTGCCGTTACCTCTGCACCATCTGCTGAAGTACTCCGACAAACGCGAGGAACCAACCATAGTCACCGTAACGAATTCACTCCCTCCTATGTCTACTCTTACTTCGGTGCCAACTAACGTCGTGAACAGCGTGGTCGTGCAAACAGCTACGATTGTGAGTCCTGTTATAAATGCCACAGTTAGTTCCAACATCAATGTGAATGCCATTACGATGGCCAAGAAAAAGAAGAAGAAGAAAGCTCTCAAGGAGAAGAAGCCGAGACCAAAACCCGGTGAAATACGATTGACGACTGCTCTAGATGGAAGTACTCTTTATTGCTGCCCGGAGTGTCATATGGCTTATCCTGAACGCGGTCTTCTGGATCAACATCTCGTGGGCCACACGATGGAAAGAAGATTTATTTGCGACATATGCAACGCGGCACTCAAACGTAAGGATCATCTGACGAGACACAAGCAGTCTCACAACCCTGAACGACCTCACGTTTGCTCCGTGTGTCTGAAGGCCTTCAAACGAAAGGAACAGCTCACGCTTCACTTTATAATACACTCGGGAGAGAAGAGACACGTGTGCAACGAGTGTGGGAAAGGTTTCTATCGCAAGGATCATCTCCGCAAGCACACACGCTCGCACATCGCGCGACGCGTCAAAGCCGAACTTTCTCAACAGTCGAACACGCCACCGCTCTCACAGACGGTACCAGCCCCGGCGTTAGAGCCCTCTTGA

Protein sequence:

>DPOGS200305-PA
MNFAPFSGHTPAASQIPTIHQFAAKFGYEGSGSTSSSVVQETRYQATGSNVQFAVTPSSGVTVDGVKFRQAESVVVVSNSQPGSVTLAGIPPIHSVNAGVKYQSITATSSTVNVGNIAYGGTGQVGYVQETVLKGEQRDKREYREEHHDIMTSMLQQHQVSQAPLQQGTQQLVVVLHEPKEGAALARAIKQEGRNQREFSNVPTEFISIGEVEGTWLGGGVADYLSALPLPLHHLLKYSDKREEPTIVTVTNSLPPMSTLTSVPTNVVNSVVVQTATIVSPVINATVSSNINVNAITMAKKKKKKKALKEKKPRPKPGEIRLTTALDGSTLYCCPECHMAYPERGLLDQHLVGHTMERRFICDICNAALKRKDHLTRHKQSHNPERPHVCSVCLKAFKRKEQLTLHFIIHSGEKRHVCNECGKGFYRKDHLRKHTRSHIARRVKAELSQQSNTPPLSQTVPAPALEPS-