Monarch geneset OGS2.0

DPOGS200508
TranscriptDPOGS200508-TA1719 bp
ProteinDPOGS200508-PA572 aa
Genomic positionDPSCF300450 - 2914-7937
RNAseq coverage180x (Rank: top 49%)
Annotation
HeliconiusHMEL0071792e-6736.36% 
BombyxBGIBMGA001704-TA1e-9542.59% 
DrosophilaCG5245-PA5e-3728.36% 
EBI UniRef50UniRef50_F7FPN58e-4732.58%Uncharacterized protein n=8 Tax=Theria RepID=F7FPN5_MONDO
NCBI RefSeqXP_001944018.15e-4130.98%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|1155287223e-4729.80%Zinc finger protein 616 [Homo sapiens]
NCBI nr blastxgi|1155287223e-5629.76%Zinc finger protein 616 [Homo sapiens]
Group
Gene OntologyGO:00036761.9e-13nucleic acid binding
KEGG pathway 
InterPro domain[501-527] IPR0130871.9e-13Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34360 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200508-TA
ATGAGTGATTTGTTGGCGTGCCGTGTGTGCCTGGCGTCTGAAGACGTGAAGCTTTACAGTCTCAATAAGTACAATTTGATGCAATCCTACGAGATGCTAACCGGAATACAGCTGACTTTAGATAATATAAAACACATCCACCTAGCACACCGTCACCCGTACAGGAGAGACGAGGAACATAGGCGATTCAACGTAGACTACACGACGGTCAAAAAAGAGACCACCATTAAAGAAGAATTGCCAGAGAAGAGGAAGAAGAGAAGAAAAAAGGAGAGTATAGAAGTTAAACTTGAAGATGAACCCCAAGACGATGGTTACGGCAATTACGAGAGCGATCAAGATATACCGGTCCTGGACTTGGATATACGTGGATTGGAGCTGCCAGATGGTCAGGAGATAGCGGGAGATAGTCAAGAGATAGCAGGGGATAGTCACGAGATAGCTGATAAAGATCTCAAGGATGTGGAGATAGTGCTGCTGAGTAAGGCGGAGCAGATACAGGAGATTGAAAAGAGGAAGACATCTTCCAACTACGTGAACTCCTACTACAAGTGTGAAAAATGCTTCAAGGGGTTTATCACTGAGCCTACCTACAGAAATCATATGGTCTGCCACGATCCTGAGCGCGGTCCACACACTTGCGACGTTTGCAACTCCCACTGGCCGTGTGCGCGCGCCCTTAGAGCACACTCGCTGAACACACACGAGAGGAAATACATGTGCAGGGTGTGCGATCATGTGTCGAGATCCAGCCATCGAGCCAAGGAGCACAGCAAGTGGCACAGCGGGTTCTCCTTCATATGCAAGACGTGCGGCGCCTCGTTCGCTAAATCCACGAGTTATCTGACACACATGCGGCTCCAGCATCCGTCGAGCAACTCGTGCGAGATCTGCGGCGAGTCGTTCGTGGGGGAGTTCGGGCTGAGGATGCACAAGAAGAAGTCACACGTCGGTACCACACGGATGCAAGCGGTCAGTATACCCACTAACCAGCTGCTAACATCTGCTCAACACCAGTTCCAGTATCCGGCTGTGAGTCAGTGCGAGCGCTGCCATCACAAGTTCGACTCCCGGGAGGCGCTGCTGCGACACGTCCAGCTGTCCGGGGACGCGTGCGACGCGGGCGAGGACACGCCACGGCCGTGTCCTCACTGTGGGGAGGGTTTCGACTCTGACGACAGTCTAAGGGACCACGTGGCGTCCCATGAGAAGGACACCGGCGTCACGTGTGAAGAGTGCAAACTAACGTTTTCGTCATCTAGTTCCTACACGATCCACTACCAGAGAGTACACCTCGGACTCAAGATGAAACAGAACAAGCCGCGCTGTTACAAGAAACCGGCCGACAGTCACGTCTGCGAGATGTGCGGGAAGAAGTGTATTACGAAGGCAACCCTGATGTATCACCAGCGGATCCACACCGGTGAGAGACCGTTCCAGTGTAGCGACTGCCCCAAGAAGTTCAGTGTGTACCAGAGACTCCAGATTCATCAACGTATTCACACGGGGGAGAGTCCTTACCAATGCAAGAGTTGTCCAAAGGCTTTCAAACACAAGGCAGCTCTCAACAGACATGATCGGGTCCATACGGGGGCAAAGCCCTACGGCTGTCCTCACTGCGGCAAGTCGTTTTCTCAGTCAAATTCTATGAAGCTGCACGTAAGCACCGTGCACCTTCGACTACCAGCGCCCTACAGGAACAGGACGAATAAAATATAA

Protein sequence:

>DPOGS200508-PA
MSDLLACRVCLASEDVKLYSLNKYNLMQSYEMLTGIQLTLDNIKHIHLAHRHPYRRDEEHRRFNVDYTTVKKETTIKEELPEKRKKRRKKESIEVKLEDEPQDDGYGNYESDQDIPVLDLDIRGLELPDGQEIAGDSQEIAGDSHEIADKDLKDVEIVLLSKAEQIQEIEKRKTSSNYVNSYYKCEKCFKGFITEPTYRNHMVCHDPERGPHTCDVCNSHWPCARALRAHSLNTHERKYMCRVCDHVSRSSHRAKEHSKWHSGFSFICKTCGASFAKSTSYLTHMRLQHPSSNSCEICGESFVGEFGLRMHKKKSHVGTTRMQAVSIPTNQLLTSAQHQFQYPAVSQCERCHHKFDSREALLRHVQLSGDACDAGEDTPRPCPHCGEGFDSDDSLRDHVASHEKDTGVTCEECKLTFSSSSSYTIHYQRVHLGLKMKQNKPRCYKKPADSHVCEMCGKKCITKATLMYHQRIHTGERPFQCSDCPKKFSVYQRLQIHQRIHTGESPYQCKSCPKAFKHKAALNRHDRVHTGAKPYGCPHCGKSFSQSNSMKLHVSTVHLRLPAPYRNRTNKI-