Monarch geneset OGS2.0

DPOGS204724
TranscriptDPOGS204724-TA1536 bp
ProteinDPOGS204724-PA511 aa
Genomic positionDPSCF300257 + 162382-163917
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0117070.074.67% 
BombyxBGIBMGA008250-TA1e-12246.88% 
Drosophilacrol-PE3e-4832.66% 
EBI UniRef50UniRef50_D2A2071e-5832.67%Putative uncharacterized protein GLEAN_08440 n=2 Tax=Tribolium castaneum RepID=D2A207_TRICA
NCBI RefSeqXP_001944018.18e-5840.58%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|3443073796e-6142.21%PREDICTED: zinc finger protein 568 [Loxodonta africana]
NCBI nr blastxgi|3443073791e-6842.21%PREDICTED: zinc finger protein 568 [Loxodonta africana]
Group
Gene OntologyGO:00056346.1e-14nucleus
GO:00082706.1e-14zinc ion binding
GO:00036766.7e-13nucleic acid binding
GO:00056228.4e-05intracellular
KEGG pathway 
InterPro domain[5-76] IPR0129346.1e-14Zinc finger, AD-type
[264-291] IPR0130876.7e-13Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL25451 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204724-TA
ATGGACTCATTGATATGTAGGATTTGTCTGGACAAGACCGGCACCATATCTATATTCGATTGTGAAACCGATAATATGCAATACAGCAGTAAAATAATGCAAATAGTCAATATCATTATAAATGAGGATGATGGCTTACCTAGTATGATTTGTGAGGGTTGTGCCGGAGACTTATCCCTATCCTATCAATTCGTGCAGAAATGCCAAGCGTCGGATAAGGCACTGCGATGTTTGAGCACGCCAATAGAACTATACTCGGATCTTCAAGTGGATATCATGAATATTAAAGAAGAGGATATCAAAAACGAAACTGAAGACGGTGACAACGAATTCGATGAAACCTTCCTGTTGGAAAATTGTTCGAAACAATATTTCGAGAAGCGACGGTATAATGACGGAGATGACGTCGATTTGAGAAAGGAGGAAGAAAAACATAAGAGTCCACAAAAATATAATTCAGATACAAAAGCATTTTTTGGAGATTGTAGAAATGATAAAAGTATTAAAGACAAAAATCGTAGAAAAAAATTGGGTCGAGGGAAAATGGGACCTATACAGTGTGTTATTTGTGGCCTGATGGCATCCAGTCCATCAGCTATGGAGATACATATGAGGACCCATACTGGAGAAAAACCTTTCATGTGTATGGCATGTAATACTAAATTCCCCACTAAAGGATCCTTAAAGAGACACAATGAAACCTACCACTTAGCTAGGGAGAGAAAATTTACCTGTGAGACCTGTGGCAGCAGCTTTTTTAGGAAAAATGATATAATAACACATATGAGAGTTCACACAGATGAGAGACCATATGTTTGTCAATACTGTTCTAAAAGGTTCCGACAAGTTGCCTCACGCAATAGACATCAGATTGTTCACACTGGAGAGAAACCATATGCATGTCCCATATGTGATAAGAAATTTGCTCATAAAAGCCTTGTCACCAAACACCAAAGTGTTCATAGTGATGAGAAAAAATACACTTGCCACCTATGTAGCAAGTCCATGAAGTCCCGAACAGCTTTAAATGTCCACATAGGGCTACACACTAACGAGAAACAGAATGTGTGTAGTTTCTGTGGAATGGCTTTCGCCATGAAGGGTAACTTACAGACGCATATCAGAAGAATACACTCTGAGAAATCTGGTCAGTGTTCCACTTGTTTAAAGACATTTTCAGATTTACAAGTGCATATGAGAAAGCATACAGGTGAAAAGCCATACATATGTGGAACTTGCAATGCTGCCTTCTCTGTTAAAAGGAGTTTAGCCCACCACATGATGTTCAAGCATGAAAACGCCGGAAAATACAAATGTTCTATTGGTGACTGCTCCAAAACCTTCCCTACGGCGACAATGCTCGAGTTCCACCTAATGAAGCAGCACACTAACCATACGCCCTATACCTGTCAACATTGCCCACGAAGGTTCTTCAGAACTAGTGATCTCTCCCGTCACCTACGAGCTAGTCATATGGACATACAATTTAAGCCTCCGCTCAAAACTCTCCTACCCAAAACTATGTCTTTCGCTTAA

Protein sequence:

>DPOGS204724-PA
MDSLICRICLDKTGTISIFDCETDNMQYSSKIMQIVNIIINEDDGLPSMICEGCAGDLSLSYQFVQKCQASDKALRCLSTPIELYSDLQVDIMNIKEEDIKNETEDGDNEFDETFLLENCSKQYFEKRRYNDGDDVDLRKEEEKHKSPQKYNSDTKAFFGDCRNDKSIKDKNRRKKLGRGKMGPIQCVICGLMASSPSAMEIHMRTHTGEKPFMCMACNTKFPTKGSLKRHNETYHLARERKFTCETCGSSFFRKNDIITHMRVHTDERPYVCQYCSKRFRQVASRNRHQIVHTGEKPYACPICDKKFAHKSLVTKHQSVHSDEKKYTCHLCSKSMKSRTALNVHIGLHTNEKQNVCSFCGMAFAMKGNLQTHIRRIHSEKSGQCSTCLKTFSDLQVHMRKHTGEKPYICGTCNAAFSVKRSLAHHMMFKHENAGKYKCSIGDCSKTFPTATMLEFHLMKQHTNHTPYTCQHCPRRFFRTSDLSRHLRASHMDIQFKPPLKTLLPKTMSFA-