Monarch geneset OGS2.0

DPOGS206214
TranscriptDPOGS206214-TA1380 bp
ProteinDPOGS206214-PA459 aa
Genomic positionDPSCF300405 + 74806-81265
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0144413e-11350.45% 
BombyxBGIBMGA011989-TA3e-4346.11% 
DrosophilaCG11247-PC7e-2629.34% 
EBI UniRef50UniRef50_H0WF542e-2831.07%Uncharacterized protein (Fragment) n=3 Tax=Danio rerio RepID=H0WF54_DANRE
NCBI RefSeqXP_002085844.11e-2429.34%GD12094 [Drosophila simulans]
NCBI nr blastpgi|3266666841e-2828.53%PREDICTED: zinc finger protein 23-like [Danio rerio]
NCBI nr blastxgi|3266666843e-3429.87%PREDICTED: zinc finger protein 23-like [Danio rerio]
Group
Gene OntologyGO:00036765e-07nucleic acid binding
KEGG pathway 
InterPro domain[391-429] IPR0130875e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25910 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206214-TA
ATGGAAGATTTATGCACGGCCTGTTTGTGCTCGGGCCGAAATTTGTTTAACATAAACGGGACAGAATTACAAGTAGTTTATTCTCAAATTTTGAATGAAATCTCTATTTCGGACACGCTGTTAGAGGTCTGCTTACGCGTGTGTTTCGAGTGCCGCGCGCTGTTGCGGCGTTATCACAAGTTTAAGAAACAAGTACAATGTTGCTACACGTCGTTACTGAAGCAAATAAAACAGGATACCCCCTCGGAGAAAATATCAAACGTGCCCCATTTGAAATCTGAGCCGATACACAACTTGAGCTATCCCGATGACATCGTGAAGGTGGAGTTCGAGGATGTGGACGTGAAGAATGAGGTCTCCGACAGGGAAGAGCTCCACTATGAGCCGGAGACGTTTGTTAAGGACCTGCAGCTGGCCAAGAAAGAGGCCAAGGACAAAAAGACGACGAAAAAGAAAACAAAACAGAAGATTAAATTGAAAACGAAGAACATGCAGGCGGGTGAGAGCGAAGTCGACGCGGATGTAAAGACGAGGGCGGACCTCTTCAGCTGCCAGGACTGTGGGAGGAACTTTGATGAGAAGCTGGACTTGGAGAAGCATATCATAGCGGACCATACGCTGAAGAAGGAGGTGAAGAATAACGGTGTGAAGGAAACCCTGCCGGAGAGACCGCAGTGCGTTGAGTGTGGAAAGGTTTTCAGTTCCCGGAAAACGTATAGATATCATCTGAACGTTCTCCACAAGGGCCAGAACAGGTATCCTTGTCCCAGATGCGGCAAGGTGTACCAGTGGAAGTCTAACCTGGGCAGGCATTTGAGGAGTCACAAGGCCCGGGACAGCGGCGAGTTACATTGTCACACCTGCGACAAGAGGTTCGCGTCCGTGGCCACCTACAGGCAGCACCTGCGCGTCTCTCGCTCACACGTCTCGGAGACTGACTTCACATTCATGTGCAACGAGTGCGGTAAGAAATTCGTCAACAAAACCAGGCTGAGAGACCACATCGACTGGGAGCACCTCAATAAAATTAAGTTCAAATGCAGCCTCTGCAATAAGCCGTTCAAGTGCCACACGTCCTTGTACGTCCACATGCAGAACGTGCACAGGAACAAGGAGAAGAAAGACAACCTGTGTCACGTCTGCGGGAAATCATACCAGAACGCCGCCAAGCTGCGCTACCACATCGTGGCCATGCACACGAGCGAGACGCCCTACAGCTGCGGCCAGTGCGGCGCGGCCTTCGGCTGGTACTCCTCGCTGTACCGACACATGCGCGAGGTGCACCACAAGATGAAAGTTCAGCCGAAGAAATCTAAGAAGTTGAAGAAATCCAGCGAGCTGGTGTCATCCTCACACGCGCATCAGGCCGGACCCGCTTAG

Protein sequence:

>DPOGS206214-PA
MEDLCTACLCSGRNLFNINGTELQVVYSQILNEISISDTLLEVCLRVCFECRALLRRYHKFKKQVQCCYTSLLKQIKQDTPSEKISNVPHLKSEPIHNLSYPDDIVKVEFEDVDVKNEVSDREELHYEPETFVKDLQLAKKEAKDKKTTKKKTKQKIKLKTKNMQAGESEVDADVKTRADLFSCQDCGRNFDEKLDLEKHIIADHTLKKEVKNNGVKETLPERPQCVECGKVFSSRKTYRYHLNVLHKGQNRYPCPRCGKVYQWKSNLGRHLRSHKARDSGELHCHTCDKRFASVATYRQHLRVSRSHVSETDFTFMCNECGKKFVNKTRLRDHIDWEHLNKIKFKCSLCNKPFKCHTSLYVHMQNVHRNKEKKDNLCHVCGKSYQNAAKLRYHIVAMHTSETPYSCGQCGAAFGWYSSLYRHMREVHHKMKVQPKKSKKLKKSSELVSSSHAHQAGPA-