Monarch geneset OGS2.0

DPOGS203101
TranscriptDPOGS203101-TA1206 bp
ProteinDPOGS203101-PA401 aa
Genomic positionDPSCF300391 + 28166-31495
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0142451e-4341.20% 
BombyxBGIBMGA001685-TA6e-3536.96% 
DrosophilaCG6654-PA2e-3030.96% 
EBI UniRef50UniRef50_F1N7M43e-3436.09%Uncharacterized protein n=4 Tax=Bos taurus RepID=F1N7M4_BOVIN
NCBI RefSeqXP_001948812.12e-3838.70%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum]
NCBI nr blastpgi|3287101094e-3738.70%PREDICTED: zinc finger protein 135-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287101091e-4139.57%PREDICTED: zinc finger protein 135-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036766.3e-10nucleic acid binding
GO:00056341e-08nucleus
GO:00082701e-08zinc ion binding
GO:00056227.9e-05intracellular
KEGG pathway 
InterPro domain[278-310] IPR0130876.3e-10Zinc finger, C2H2-type/integrase, DNA-binding
[10-84] IPR0129341e-08Zinc finger, AD-type
Orthology groupMCL23630 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203101-TA
ATGTCTGAAATATCGGAGTTTAAGGCTGTGTGTCGTTGTTGTCATACTTATGATATGCTTAAAGATATTACCAGATTATATGAAAATAATCATATTACGGAAATGTATTCAGAAATGCTAAACGAGACATTCGGTTTAACACTCCGGACTCCACCTGTAACTGTTCAGTATACAATATGTGACACCTGTATTATTCAGCTGGAAGCTGCAAATCAGTTCAAGAAACAAGTGCAGAAATGTGAAGCCAAGTTTATAGAGTGTTGTCAAAAGAACAACTTCAATTACAAGGTAAATGAATTGGAGACAGTGAAACATGAACCGGGGGAATGCTGTGAGAAAAATGAAGTTTCCAGAAAGAACAACTTCAATTACAAGGTAAATGAATTGGAGACGGTGAAACATGAACCGGGGGAATGCTGTGAGAAAAATGAAGTTTCCGGAAAGTCAAATATATCTTATAAGGAACGTAAGAAAGATAAAAGAAACACAAAAATAAAGGATCACCAATGCTTTGAGTGTCAGATATGTAAAAAAAGTTTTATATCAATGCAAAAGATGTTGAACCACAGGCACAGACAGCATCAAAAGAAATCTGTCTTCATTTGTGACATGTGTCACAAGGAGTTTCTACATAAGAACTCCTTGTTAAAACACATAGGATGGCACATGGGGATAAATAAAAGATTTATATGCGAGATATGTGGATATTCATTTCACGATAAGACAAATTTAAACGTTCATTTACAGGCCGTCCATCAGAAACTGAAGTTATACACGTGTACACTGTGTCCAAAAAAATTCGCTGCGAATAAGAATTTAAAAATACATTTCCGTCTTCACAGCGGCGAACGGCCGTATAAATGTGATGTTTGCGATGAAGGTTTTATATGTTCGACCTATCTGGTGAAGCATAAACAAAAACATGACAATGTCAAAAAATTTGGCGGGAAATACGTTTGTAAGGTTTGCAGTACTGTTTTCAACGAGCGGCATGTTTTCACGGCGCATATGCGTTCTCACGTCGGTGTTAAGCCGTACAGATGTAGTTACTGCGAAAAGGATTTCTTTACACGTTTCTCTCTCAAACGACACAACGAAAATCAACATAACGAAAACATCGTATGTAAAAAATGCGACGCCATTTTTTCTGATAAAACAAATTTACAGCGGCATGTCAAAATGCACGGTAAAGAGAAAAAGTCGTGA

Protein sequence:

>DPOGS203101-PA
MSEISEFKAVCRCCHTYDMLKDITRLYENNHITEMYSEMLNETFGLTLRTPPVTVQYTICDTCIIQLEAANQFKKQVQKCEAKFIECCQKNNFNYKVNELETVKHEPGECCEKNEVSRKNNFNYKVNELETVKHEPGECCEKNEVSGKSNISYKERKKDKRNTKIKDHQCFECQICKKSFISMQKMLNHRHRQHQKKSVFICDMCHKEFLHKNSLLKHIGWHMGINKRFICEICGYSFHDKTNLNVHLQAVHQKLKLYTCTLCPKKFAANKNLKIHFRLHSGERPYKCDVCDEGFICSTYLVKHKQKHDNVKKFGGKYVCKVCSTVFNERHVFTAHMRSHVGVKPYRCSYCEKDFFTRFSLKRHNENQHNENIVCKKCDAIFSDKTNLQRHVKMHGKEKKS-