Monarch geneset OGS2.0

DPOGS203802
TranscriptDPOGS203802-TA2466 bp
ProteinDPOGS203802-PA821 aa
Genomic positionDPSCF300010 + 1683436-1690142
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0125000.084.03% 
BombyxBGIBMGA003705-TA0.074.96% 
DrosophilaOaz-PC3e-5339.35% 
EBI UniRef50UniRef50_D6W6767e-11031.36%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W676_TRICA
NCBI RefSeqXP_966615.11e-10731.19%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|3214776723e-8538.10%zinc finger protein [Daphnia pulex]
NCBI nr blastxgi|3214776727e-9038.53%zinc finger protein [Daphnia pulex]
Group
Gene OntologyGO:00036769.1e-10nucleic acid binding
KEGG pathway 
InterPro domain[42-69] IPR0130879.1e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL12761 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203802-TA
ATGGCACATCCAAGAAGCCGATCTGACCCCGCAAGCGAGACGCATTCAGATCAAATGCCGTTTCGTTGCGAATTCTGCTCTCGTCTGTTTAAACACAAACGCTCCCGCGATCGCCACGTCAAACTTCACACAGGCGACAGAAAGTACAGATGTGTCCACTGCGAATCCGCTTTTTCCAGAAGTGACCATCTCAAGATTCATATGAAGACTCATGACAATCAGAAACCGTTCCAATGTACAGTTTGTAACCGGGGATACAATACAGCAGCAGCTCTCACATCACATATGCAGGGTCACAAGAGAGACCGAGAGGGTCGTGATAACGATCGAAGAAGAGCATTACGTTGTCTGCGCTGTGGAGATGCATTTAGAAGAGCTGATATGCTCCAGGCTCATATGATAAGTGCTCATGGAGTTGATGCGGAATCTATGACTCCGCCGCGTCGAGTTGCTTCACAGCCACCACCCACTTTACTAGCTTGCATATATTGCACTCGCGATACTTTTACCAGTATGGAACAGCTACAATTACACGTACGGACAGCCCATTCTGCATTATTAAATGGAGAAACACCAATTCAGTTCTCGGTAGACCAACCAGGACCGACAGACTTAAGTAGACGAGCTTCAGAAGATGTTTCTCCAGTAAAACGGCCAAGACTTAGCTCAGGTTCTACTACGCCTAAGAATGCTTTATCACCAAGTACTCTACTTTGTAATCAATGTGATGCAGCTCTGCCTGATTTCGAAGCTTTTAGGGCCCATTTGAAGGGTCATCTTGAAGAAGGAGGGGAACTTGGACGAACTAGTCCAAGTCCTTGTATCCATTGTGGAGCAACATTTGCAGACGTGGCAGCTTCTGAGCGCCACCTTGCAGCACATTATTTGGCTGTATCGTGTGAATATACATGTCACAGTTGTACGCGCAGTTTTCCTACTCCCGAAGATTTGCAGAAGCATCTCTTCGATTTACACACTCACCATATGTATAGATGTACATTCTGTAAAGAAATATTTGATTCCAAAGTTGCTATACAGGCTCATTTCGCGGTGGCACACAGTGGAGAGAACAAAGTGTGGGTTTGTCGATCATGTGGGGCCGCAGGTGGCCCGATGCGTTCAGAGGCGGAGGGGGCCGCCCACGTGCGAGCGAGGCATGCGGCAGCTCGCTGCTCCTGCGGAGCTGTACTGTCCGGTTCAAGAGCTCTACGAGCTCATGCTGCCTCACATCACGCATACCGCTGCCCTATCTCCACATGCAGCGACTCCTTTGCCGTGCAATATCTCCTAGAGCGACATATGCAAGTGCATCAAGCGATCACGCATCAGAATGTCAACGGTGAGAGAAACAAGCGAAATGAGAACAACAACGCCGTTGATGGCGATGGGGCGTGTTCTCCATGTATCTCCGGTGCGGATAATAATGGTCCTATTCCAACTGTTGGAGAAGAAAGACGACGAAAAAATGGTGCAGTAGCGCTACAATGTGCTTATTGTGGAGAAAGAACTCGTAGCCGAGCGGAGTTAGAGGCCCATACTCGAGCACATTCTGGTGCAGGCGCAGCGCGGCATAAATGTCTTATATGTGATGAAGTTCTACCCTCAGCTGCAGTGCTGGCTGAACATAAACTTACGCATTGCAAGGTGGTAGCTGGAGATACGTGTGCCCGATGTCGCTCACGTCTGCCTTCAGAAGAAGCGTTCTTGAGTCATATGGCACGACATCATCCTGCCTTACCAGCGCCGTGTGTTATATGTCGTCAAACATTGGCTTCGGAGGCTGAAGCACGTTTACATGCCCGTTTCCATTTACGGCCGTCTGCAGACGAACAAAGATGTGCGATATGTCTTAGAGCTTTATCCGAGAGTGAAAGCGGTGAAGGTGCAAGAGCATGTTCCACTTGCTATGCCAGACACGCAGCTCCTAGACCACAGACAACTACCGATCATGACTGTAGGTTATGTCGAAGAACATTGGGTTCCCCTACTCGTCTACAAGCTCACTTAATTGAACACACATTTGCTGGTATAGGTGCATTCACATGCTACCTGTGCTCCGCGATGTTTACGAGTGCTGCAGGCCTGCAAAGACATTTACCAGAACATGCTGCTGCTCCAAGACCTTACGATTGTGGCCGATGTGGTTTAAAATTCTTTTTTAGAGCCGAACTAGATAACCACGCTTTTGTACATCTTGAGGAAGCTGAAATTGCTCAAAGAGTTTTCTACGAGGCTTATGCACGAGGAGCAGCAACCGCCTGGGCTGCTTTGCAGCCAACTGATATACAACCACAATCATCAGCTTCAACACCAGCCCCTACAATAAGTGACGTTAAACAAGAACCAGAAATAAAGGAGGAACGTAACGATGAGTATATAGAAGTATCATCGCCGCCTCCGAAACAATCTGAGCCAGCAACGTCACCTACTCCAATGATAAAACAAGAGAAAACTGACGAAGACTGA

Protein sequence:

>DPOGS203802-PA
MAHPRSRSDPASETHSDQMPFRCEFCSRLFKHKRSRDRHVKLHTGDRKYRCVHCESAFSRSDHLKIHMKTHDNQKPFQCTVCNRGYNTAAALTSHMQGHKRDREGRDNDRRRALRCLRCGDAFRRADMLQAHMISAHGVDAESMTPPRRVASQPPPTLLACIYCTRDTFTSMEQLQLHVRTAHSALLNGETPIQFSVDQPGPTDLSRRASEDVSPVKRPRLSSGSTTPKNALSPSTLLCNQCDAALPDFEAFRAHLKGHLEEGGELGRTSPSPCIHCGATFADVAASERHLAAHYLAVSCEYTCHSCTRSFPTPEDLQKHLFDLHTHHMYRCTFCKEIFDSKVAIQAHFAVAHSGENKVWVCRSCGAAGGPMRSEAEGAAHVRARHAAARCSCGAVLSGSRALRAHAASHHAYRCPISTCSDSFAVQYLLERHMQVHQAITHQNVNGERNKRNENNNAVDGDGACSPCISGADNNGPIPTVGEERRRKNGAVALQCAYCGERTRSRAELEAHTRAHSGAGAARHKCLICDEVLPSAAVLAEHKLTHCKVVAGDTCARCRSRLPSEEAFLSHMARHHPALPAPCVICRQTLASEAEARLHARFHLRPSADEQRCAICLRALSESESGEGARACSTCYARHAAPRPQTTTDHDCRLCRRTLGSPTRLQAHLIEHTFAGIGAFTCYLCSAMFTSAAGLQRHLPEHAAAPRPYDCGRCGLKFFFRAELDNHAFVHLEEAEIAQRVFYEAYARGAATAWAALQPTDIQPQSSASTPAPTISDVKQEPEIKEERNDEYIEVSSPPPKQSEPATSPTPMIKQEKTDED-