Monarch geneset OGS2.0

DPOGS205427
TranscriptDPOGS205427-TA2046 bp
ProteinDPOGS205427-PA681 aa
Genomic positionDPSCF300504 + 14431-27310
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0115330.066.44% 
BombyxBGIBMGA001744-TA2e-11950.12% 
DrosophilaCG5245-PA6e-2427.08% 
EBI UniRef50UniRef50_UPI00022B3AB43e-2931.08%UPI00022B3AB4 related cluster n=1 Tax=unknown RepID=UPI00022B3AB4
NCBI RefSeqXP_002427629.17e-2828.95%gonadotropin inducible transcription factor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3343289141e-2826.34%PREDICTED: zinc finger protein 420-like [Monodelphis domestica]
NCBI nr blastxgi|2926214814e-3429.25%PREDICTED: zinc finger protein 226-like [Danio rerio]
Group
Gene OntologyGO:00036761.6e-13nucleic acid binding
GO:00056344.9e-12nucleus
GO:00082704.9e-12zinc ion binding
GO:00056221.9e-05intracellular
KEGG pathway 
InterPro domain[604-629] IPR0130871.6e-13Zinc finger, C2H2-type/integrase, DNA-binding
[89-157] IPR0129344.9e-12Zinc finger, AD-type
[17-71] IPR0066122.6e-09Zinc finger, C2CH-type
Orthology groupMCL19952 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205427-TA
ATGATCACAGAGAATTGTGTTAGCTGCATGCCATTAGCTTTTGTCAGAAGATTCCCCAAAGAGGATATCCTCCGCACCGCATGGATTGATGCTCTCGGTATGACAGAATGGGAATCCAAAGATCCAAGCATGGTGTGCTCTGAACATTTTACTGAAGATTCATTCTACGTCACAAAATGTGGCGTAAAAAAGGTCAAAGCCGATGCAGTGCCTTTACCGGTCTTTATAACATCGGATGAATTAGACTCGGCAGCGGATAAAAGGGTGTGCAGAGTATGTTTGTCGATAGATGTCAAAATGTATGACATAACAGAAGATGGAGTTAGTACGATGTTTGAACGGATCACGAATATACTTATCAACCCACACGATAGATTACCGCATCGTATATGCTGGGAGTGTAGCGTGCGGTTGAAGAACGCTGATAGCTTCCGGGCTAAAGCTATCAGATGCGATGAGCTGCTTAACAAATTGGATTTGGACAGAACTATAACATTAAGAGATGTAAAATCAATAAATAGGACATTCAAAAAATTTAAAACTACCCTAATACAGAAAATATATGAGACAAACGAATATGATTATGAAATTAATGACGAGATCAAAACAGAAGAAAATAATGACACAGACTTTAATGATTATAATGATAATGATATTAATGATATCAAGAACGAACCCGAGATAAAAGATGAGAGTAACAATGAGATATTCTTCGAGGAGTATCCGGTTTTAAACGACAGCGACGAGATAACACTGAGCGAAGTCTGCAAGAAGAAAAAGGTGAGGAAGAAGAAAGTTAAAATAGAGAAGAGAAAAGAAGTTAAAATTAAAAAGAACGATGAAAACGACAGCCCGTCCATTACAATGGATAAGTACAAAAAATCCGATGTTAAACGTCGCAAAACCGATAACTTGGACGAATCGTTGTTCACTATAACGACTCTCACTTACGACGAGCAGATCGCTGAGATAGAAAAGCGACAGGAATCGGCGTCGTATAGACACTCGCCGTACAAATGTCTCACTTGTTACAGAGGATTCCTCATCAGAGATAGATATGAAGCACATGTCGTGAGACACAGCGAGCAAAGTGGTGCCTACGAGTGCTTTATATGCAAGACACGCCTGAAGACTGCGAGAGCTCTAAGAAAACATCTGACGGCGCAACACACAGAGAAATTTAACTGCAAGGGATGTCCGTTTGTTACCAGGAACAGGGGTGTAGCTCGTGAACACGAGAAATGGCACGCCGGCACCAAGTACCAGTGTCCACACTGTCCAAGCGAGTTTGATAAACTGACCACCTACATGGGCCACATCCGCATAAAGCACGTGTCGGACTTCGTGTGCGAGTTGTGTGGGTACACGTTTGTTAGTAAGAAGGGGGTAGACGTTCACAAGAAGAAAAAACATAGAGTTGTCGACAAAAACGTGGAACTGACTGGTCCGTTCTGTGAGGTATGCGACGTCCGCTTCCTCTCCGAGGAGGCGCACTCGAGACATCTGAAGTTGTCATCCAAACATAGCAGCGACAATGACCCTAATCGTATCCGGAACGACTCTCAAAGTATGAGCAGTGAGAGAGGACTCGCGAGAAGGAGGGACGTCAGAGAAACTGGTGATGCCTCACCTGTTACGTGCGAACAGTGCGGTCTTCAGCTGCGTGACCTGCGTCTATACGCGCAGCACTTCCGTCGGTCACACCCGGATAAGAACCGTACCAAGTACCCGGCTATGAAGACGCCGGCCATGTGCGAACACTGCGGCCGGATATTCCAGAGTATGGCGCTGTTGAAAGATCACATGTGGGTCCACACTGGGGAGAAACGGTTCAAGTGTGACAGGTGTGACAAGAGCTTCACCCAGAAGACAAACCTGGTCTTCCACATGAGGATCCACACAGGCCTGAAGCCCTTCAAGTGTGACACCTGCGGGAAGAGCTTCACCACTGCCGGTGAACAGCGAGCCCACACAGACCATGTACATCTCAAGAAACCCTGGCCGAAGAGGGCGAGGAGCGGCCAGTGGAAGTGTGTAGAGGATTAG

Protein sequence:

>DPOGS205427-PA
MITENCVSCMPLAFVRRFPKEDILRTAWIDALGMTEWESKDPSMVCSEHFTEDSFYVTKCGVKKVKADAVPLPVFITSDELDSAADKRVCRVCLSIDVKMYDITEDGVSTMFERITNILINPHDRLPHRICWECSVRLKNADSFRAKAIRCDELLNKLDLDRTITLRDVKSINRTFKKFKTTLIQKIYETNEYDYEINDEIKTEENNDTDFNDYNDNDINDIKNEPEIKDESNNEIFFEEYPVLNDSDEITLSEVCKKKKVRKKKVKIEKRKEVKIKKNDENDSPSITMDKYKKSDVKRRKTDNLDESLFTITTLTYDEQIAEIEKRQESASYRHSPYKCLTCYRGFLIRDRYEAHVVRHSEQSGAYECFICKTRLKTARALRKHLTAQHTEKFNCKGCPFVTRNRGVAREHEKWHAGTKYQCPHCPSEFDKLTTYMGHIRIKHVSDFVCELCGYTFVSKKGVDVHKKKKHRVVDKNVELTGPFCEVCDVRFLSEEAHSRHLKLSSKHSSDNDPNRIRNDSQSMSSERGLARRRDVRETGDASPVTCEQCGLQLRDLRLYAQHFRRSHPDKNRTKYPAMKTPAMCEHCGRIFQSMALLKDHMWVHTGEKRFKCDRCDKSFTQKTNLVFHMRIHTGLKPFKCDTCGKSFTTAGEQRAHTDHVHLKKPWPKRARSGQWKCVED-