Monarch geneset OGS2.0

DPOGS208435
TranscriptDPOGS208435-TA1044 bp
ProteinDPOGS208435-PA347 aa
Genomic positionDPSCF300095 + 117343-119260
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0072252e-15487.11% 
BombyxBGIBMGA009541-TA7e-13980.22% 
Drosophilaodd-PA1e-6640.29% 
EBI UniRef50UniRef50_E0VDT09e-7349.71%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VDT0_PEDHC
NCBI RefSeqXP_002424274.12e-7349.71%hypothetical protein Phum_PHUM125070 [Pediculus humanus corporis]
NCBI nr blastpgi|2420068883e-7249.71%hypothetical protein Phum_PHUM125070 [Pediculus humanus corporis]
NCBI nr blastxgi|2420068885e-7449.42%hypothetical protein Phum_PHUM125070 [Pediculus humanus corporis]
Group
Gene OntologyGO:00036769.5e-12nucleic acid binding
GO:00082706.6e-06zinc ion binding
GO:00056226.6e-06intracellular
KEGG pathway 
InterPro domain[201-228] IPR0130879.5e-12Zinc finger, C2H2-type/integrase, DNA-binding
[258-280] IPR0070876.6e-06Zinc finger, C2H2
Orthology groupMCL14732 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208435-TA
ATGGAAGATAAGCCGTACAGTGTTGACAGTTCCGACGATATCAATATCGAGGAAGAGGAAGACGAACACATCGAACAAAATCCTAGTTATCCGCCAGATTTTCCAATTGAGGCTTTAGTGAAGTTAGAAAGGCCATCACCTCCTTTACCGACGCCTCCTCACACTCCGACTGACGACACTCCACCGGCAATTGTCGTATCACGACACCAAGCTGCCTGGTCACCTCAATATCCCATCCAAAATCGAACAATGCAGTCGCCTTATCCTTACGACAAGCAAATTCCTAATTATACATACCCGACAGTTCCTGGTGTACCAGCTGTACCTTACGTCCAACCCCAAGTTGCAGCACTCCCTCCGTCAGCACCATACCACGCGATGCTGGTAAATCAGTGGATAAGGAACGCGGCTTTGTATCATCACTCATTAAGATATCCCTCAATGGCGCAAAGAATACCTGGAAGAAATCCGCCTCAAATGAGAGCACCCAGTGTTCCCGGTACGCGGCCGAAGAAGCAGTTCATCTGCAAATACTGCAATCGCCAATTTACAAAGTCCTATAACTTATTGATACATGAAAGAACGCACACGGATGAAAGGCCTTACTCATGTGATATTTGCGGGAAGGCTTTCAGAAGACAGGATCATCTCAGAGATCACAGGTATATTCATTCTAAGGAGAAGCCTTTTAAGTGCACTGAATGCGGGAAAGGTTTCTGTCAATCAAGAACATTGGCTGTTCATAAAATTTTACATATGGAAGAATCTCCACATAAGTGTCCTGTTTGCAACAAAAGTTTTAACCAAAGATCGAATTTGAAGACTCACTTATTGACTCACAGCGATGCAAACAAGCATCACTTGGAGCACTTGGAAGGCTGTCAAGAAGTATCTTCAACAAGCCAAGTCCCCGAGTCTCCTTTGTTAGATCTTTCACACAAACCATCGCCACCCCCTGTACACTACCCGCTCAATCCCCAAGAATCCCCCGTCGAACCAAAAAGGCCTTTAGGATTTTCAATAGAAGATATTATGAAGCGTTAA

Protein sequence:

>DPOGS208435-PA
MEDKPYSVDSSDDINIEEEEDEHIEQNPSYPPDFPIEALVKLERPSPPLPTPPHTPTDDTPPAIVVSRHQAAWSPQYPIQNRTMQSPYPYDKQIPNYTYPTVPGVPAVPYVQPQVAALPPSAPYHAMLVNQWIRNAALYHHSLRYPSMAQRIPGRNPPQMRAPSVPGTRPKKQFICKYCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHRYIHSKEKPFKCTECGKGFCQSRTLAVHKILHMEESPHKCPVCNKSFNQRSNLKTHLLTHSDANKHHLEHLEGCQEVSSTSQVPESPLLDLSHKPSPPPVHYPLNPQESPVEPKRPLGFSIEDIMKR-