Monarch geneset OGS2.0

DPOGS205367
TranscriptDPOGS205367-TA1506 bp
ProteinDPOGS205367-PA501 aa
Genomic positionDPSCF300373 - 125155-131784
RNAseq coverage620x (Rank: top 21%)
Annotation
HeliconiusHMEL0134450.082.77% 
BombyxBGIBMGA008759-TA5e-6263.83% 
DrosophilaCG6654-PA7e-1224.09% 
EBI UniRef50UniRef50_UPI00022B1FE61e-1528.38%UPI00022B1FE6 related cluster n=1 Tax=unknown RepID=UPI00022B1FE6
NCBI RefSeqXP_973104.11e-1322.59%PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum]
NCBI nr blastpgi|3485126014e-1528.38%PREDICTED: zinc finger protein 850-like [Oreochromis niloticus]
NCBI nr blastxgi|2607884611e-1923.76%hypothetical protein BRAFLDRAFT_242600 [Branchiostoma floridae]
Group
Gene OntologyGO:00056342.5e-08nucleus
GO:00082702.5e-08zinc ion binding
KEGG pathway 
InterPro domain[14-93] IPR0129342.5e-08Zinc finger, AD-type
Orthology groupMCL18340 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205367-TA
ATGGAAGCTAAAACAACTGAATGGCGACCGGGGCCAACTGTGTGCAGATGTTGCCTCGCTGAAGGATGTTATAAGGACATTTCGACGGAATATTTTTGGATGGGAAAGCGAGAAGTTTATGCGGAAATGCTATTGGAAACTTTCGATTTGAATATTTCGTACTCACAATCTGGTGGGCCGAACAGCAACAGTCGTCTGATCTGCGAGCCTTGCATCTCTAGATTAAGGGATGCAGCCGACTTCAAGCGTCAAGTCACTGAGTGCGAGCGGAGCTTCACACAGTACTTGGACCCAGTGGCTACTGATGTTGACGTGGACTTGGGACTGGAGAAGGAAGTGAAGATCGAGCGTGTCAAGGATGAGAAGTCGATGAGTGATGATGAGTTCGTTGCACCGGAGTTTGTCGATGATGATGATGATGATGATGATCTTGATGACCAACCCTTAACGAAATTAGCCAGCAAAATACCCAAAAAGGAGTCTGTGGATGTACTAGATCTGTTGGACAATGCCAAAGTCACTGAAAAGAGGAAATCTTCTAGCAAAACGAAATCATCACCGGCCAAAAAGTCTAAACTGAAAAAGGAGGTCGCTAGTTCTAGCAAAGTGAAACCGGAACCGAGAAAAAAGAAAGAGGAATGGTCGGATCCCGCGAGAGGCAACGCCGGTACCATAATAAAATGTACAACCGCTTACCCCTTCAGAGTGAATGATAAGTGCATTGTGTGCGTTTACTGCCAGGAACTGTACGATGATCCGGAACTGTTCAGGCGGCATATGGATGATGAGCATCCGACGTTCAACGTCAAAGTAGCCTTCCATAACATACCGAAATTAGAGTTCATTAAGGCTGACCTCACGGAGTTGAGATGTAGGCTGTGCGATGATCGCTTCGACAGCTTAGAAGTCATAGCGGAACATCTGAAGAGTAATCACAATGAGGCTATAGATCTGAACGCCAGACTGGGCGTGATGCCATACGTTCTGAGAAAGGATTTCTTTAATTGCGTCGTCTGCGGCAAGAACTGCCCTTCGTTGTTCCATCTAAACAGGCATACGATAACCCATTTCCTGAGTTATGTCTGTCATTTTTGTGGTAAAAGCTATGTTGCGACAACAGGCCTGCTTCGTCACGTGCGCTCGAAGCATCAGGAGTACAAGGTGTCGTGCAGGCGATGCGGCAAGGTCTTTCCTAATATGGAGGCCAAGGAGAGGCATAGGAGAACGGAGAAGTCTTGCATGGCGTACTGCTGTTCTAAATGCCCGGAGAGGTTCCACGATTGGAAGCTTCGTAAGCGCCACATGGAGACCGAGCACGGTCAGAGCAAACGTATAGATCGCTGTGCCGACTGCAATATAACATTCAGCAAGGGTAGCGCTTACTACCAGCATTTCAAACTAAAACATTCGAAAGATTGCGTCGTTTGCAAACATTGCGGCATGAAATTCATTTGTCGTTCCCGTTACAAGAGACATCTGTCCACTCACGTCAGCTACACGGCCTAG

Protein sequence:

>DPOGS205367-PA
MEAKTTEWRPGPTVCRCCLAEGCYKDISTEYFWMGKREVYAEMLLETFDLNISYSQSGGPNSNSRLICEPCISRLRDAADFKRQVTECERSFTQYLDPVATDVDVDLGLEKEVKIERVKDEKSMSDDEFVAPEFVDDDDDDDDLDDQPLTKLASKIPKKESVDVLDLLDNAKVTEKRKSSSKTKSSPAKKSKLKKEVASSSKVKPEPRKKKEEWSDPARGNAGTIIKCTTAYPFRVNDKCIVCVYCQELYDDPELFRRHMDDEHPTFNVKVAFHNIPKLEFIKADLTELRCRLCDDRFDSLEVIAEHLKSNHNEAIDLNARLGVMPYVLRKDFFNCVVCGKNCPSLFHLNRHTITHFLSYVCHFCGKSYVATTGLLRHVRSKHQEYKVSCRRCGKVFPNMEAKERHRRTEKSCMAYCCSKCPERFHDWKLRKRHMETEHGQSKRIDRCADCNITFSKGSAYYQHFKLKHSKDCVVCKHCGMKFICRSRYKRHLSTHVSYTA-