Monarch geneset OGS2.0

DPOGS215557
TranscriptDPOGS215557-TA2298 bp
ProteinDPOGS215557-PA765 aa
Genomic positionDPSCF300129 + 705644-711588
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0061699e-10939.25% 
BombyxBGIBMGA010686-TA2e-14839.46% 
DrosophilaCG5245-PA2e-2123.49% 
EBI UniRef50UniRef50_UPI00005889CB9e-2726.21%UPI00005889CB related cluster n=1 Tax=unknown RepID=UPI00005889CB
NCBI RefSeqXP_001187675.12e-2726.21%PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus]
NCBI nr blastpgi|2607888466e-3227.55%hypothetical protein BRAFLDRAFT_280890 [Branchiostoma floridae]
NCBI nr blastxgi|2607888465e-4527.55%hypothetical protein BRAFLDRAFT_280890 [Branchiostoma floridae]
Group
Gene OntologyGO:00036764.1e-13nucleic acid binding
GO:00056343.1e-07nucleus
GO:00082703.1e-07zinc ion binding
KEGG pathway 
InterPro domain[653-680] IPR0130874.1e-13Zinc finger, C2H2-type/integrase, DNA-binding
[7-74] IPR0129343.1e-07Zinc finger, AD-type
Orthology groupMCL25759 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215557-TA
ATGTTCGACCTTAAGGCATGTGTCGTGTGTTTAAAAGCCGATGTAAAGTTATTTAGCATGAATAATGGTCAACTTAGACAACAATTCTACTTAGTAGCTGGGCTTAAGACAATTTCTGGAACTGGTCTACCAGAATATTTATGTTATCAATGTAAGGGATATGTAAGAAGTTTTATAAGATTTCGTGAAAAATGCCAAAGAACATTTTATGTGTTACAAGAAATTTTAAGAACAAACAATGAGATAACACAATCTGTATTAAATGAATTGGATTATAGTATTCTAAACATTAAGCCAGGCTTAGCTTACATGGACTCTATGAAAGGCGGTCATCAGTATGAGGACATAAAATTCAAGTGGTTGAAACAGAATCGCACATGCATAGAACCTCAAGGCACTATTCCCGTTATGTTATGTAGTACGACAGAAAATATATTTGAATTGCCGTTCAAAACGAAATCGAATTGTGATGATGGGAAGAATACATATGATATGGCTGTTGGATTAAGCAATAATGTTGAGAAGAATGAAGTAGAATTGTTTGATTCCAATATAAATATAAGTAATAATGATTTAGAAGTATTAGAATTGCAGAAAGATGATGCCGACGGTTCCATCCTCAATGAGGAATATGGTAATGTAATCCCAATCAGTCTGAAAGAGGCTCAGGCTGTTGTAGATATTAATAAAAAATTTGCACTCGGAAAGTTTCGTTGTGATATCTGTGATAAGGCGTACTGTAATGAAAAAAATTTAGAATTACATAAGAGGATGCATGTCGAGAGCGTAAGTGGTTCACATTACTGTGTGCTGTGTAAATATTATTATAAGACAGAATTTTTACTGAAAACCCATTATAAAGACAAACATATGTATAAATATTTATGTAGGAATTGTCCTGAAGTTAGTTTTGACAGATTTTCAGCAAAAAGACATTTCATGTTGATACATGGTCCGAAAGGTACAAAGAAAGATGGTGTCACCGACAAACTAAACAAGAAGAACATAGACAAGAAAAAACAAGGGATCTATGTTCATAAGAAGATTAAACCCAAAGATCCGGAAGATTTCCTCATATATACACCGATAAAACAAGCGGAACAATACTCTATGGTGCTAGACAGGCAGAAAACAAAGAATTATATAGAATCTCCGTACAAATGTCAGTATTGCTTCAAAGGTTTCAGGGAAGTGGTCACATATGAGAAGCACATGCAGAAACATGACCCTGTGTATTCCGGTAAATATCAGTGCGACATGTGTAAAATACACTGTTCGAGCACGAGGAAGATGTACAAACATATGAACACTACGCATAACTTCAAATTTTCCTGTCAAATGTGCAGTTTCGTGTGTTACAGCAGAGGTCAAGCAAGATCACATTATCAATGGCACAAAAACGTTACTTACTCCTGCCCGCACTGTACCAAAGTTTTTACGAAGCAGTCGACTCGTTTGACCCACATTCGTATAAAACATCCGTCTACTTATATCTGTAATATATGCGGACACAGTTACGTGAGCGAGGCTGGTTTATACTGTCACAAGAAGATAGCACACAGCGCTGAGGAGATAAAAGTCCAAGAGATGCCGACTCCGTCCCTATCCCTGTACTGTTCTGAGTGTGAAGTACAGTTTACCAATCAAAAAGCCTACGACACACATTTCGGATCATCGAACAAACACGCAGATACTAACGTATCAACTAAACCGTCTCGTAGTAATAAGTGCAGTCCGTCGCGACCTCGCGGCCGGCCTCGGTCGGGGTCCGATGTCCTCAACACCGGGGTCACGACCGCTTCGCACTGCGAGATATGCCAGCAATACTTACCAAACGACGTCCAAGCGAAGCGACACTACGAATCCGAACATCCGGGGGCGACTTACCTCAAGAGATACATGTGTGATATATGCGGACATACAACTAAGCAATACGCGAACCTGTTGGTACACATGCGGACGCACACACAGGAAAAGCCGTATTCGTGTCCTCACTGTCAAAGGAGATTTAGTATGGTCAGCAACAGAGACAGACATCTGGTGGTACACACAGGTGAAAAGAGATATCAATGCCAGCATTGTAACCGTCGCTTCACACAGAGCAGTGCCGTCAAGCTTCACATACAGACTGTCCATCTGAAGATACCTTATGCTCCGTGGAATAAGAAGAACCGGAAACGACGTCGCGACGAGCCCGCTCCCCCCTCACCTACACCGCCCCAACCTCCTCAGGCCCCCCACAAGCTGGTGTTAGACGCTGGGAATTACCTCAGCGCCTATATAACATATAATGAATAG

Protein sequence:

>DPOGS215557-PA
MFDLKACVVCLKADVKLFSMNNGQLRQQFYLVAGLKTISGTGLPEYLCYQCKGYVRSFIRFREKCQRTFYVLQEILRTNNEITQSVLNELDYSILNIKPGLAYMDSMKGGHQYEDIKFKWLKQNRTCIEPQGTIPVMLCSTTENIFELPFKTKSNCDDGKNTYDMAVGLSNNVEKNEVELFDSNINISNNDLEVLELQKDDADGSILNEEYGNVIPISLKEAQAVVDINKKFALGKFRCDICDKAYCNEKNLELHKRMHVESVSGSHYCVLCKYYYKTEFLLKTHYKDKHMYKYLCRNCPEVSFDRFSAKRHFMLIHGPKGTKKDGVTDKLNKKNIDKKKQGIYVHKKIKPKDPEDFLIYTPIKQAEQYSMVLDRQKTKNYIESPYKCQYCFKGFREVVTYEKHMQKHDPVYSGKYQCDMCKIHCSSTRKMYKHMNTTHNFKFSCQMCSFVCYSRGQARSHYQWHKNVTYSCPHCTKVFTKQSTRLTHIRIKHPSTYICNICGHSYVSEAGLYCHKKIAHSAEEIKVQEMPTPSLSLYCSECEVQFTNQKAYDTHFGSSNKHADTNVSTKPSRSNKCSPSRPRGRPRSGSDVLNTGVTTASHCEICQQYLPNDVQAKRHYESEHPGATYLKRYMCDICGHTTKQYANLLVHMRTHTQEKPYSCPHCQRRFSMVSNRDRHLVVHTGEKRYQCQHCNRRFTQSSAVKLHIQTVHLKIPYAPWNKKNRKRRRDEPAPPSPTPPQPPQAPHKLVLDAGNYLSAYITYNE-