Monarch geneset OGS2.0

DPOGS203091
TranscriptDPOGS203091-TA999 bp
ProteinDPOGS203091-PA332 aa
Genomic positionDPSCF300391 - 202023-203021
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0115708e-2630.14% 
BombyxBGIBMGA011392-TA4e-16180.72% 
DrosophilaCG12299-PA3e-2226.24% 
EBI UniRef50UniRef50_F1N7M44e-2931.50%Uncharacterized protein n=4 Tax=Bos taurus RepID=F1N7M4_BOVIN
NCBI RefSeqXP_001946669.11e-3131.88%PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp), partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3287264185e-3131.72%PREDICTED: zinc finger protein 271-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3545009921e-3632.27%PREDICTED: zinc finger protein 420-like [Cricetulus griseus]
Group
Gene OntologyGO:00036762.1e-07nucleic acid binding
KEGG pathway 
InterPro domain[190-217] IPR0130872.1e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23305 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203091-TA
ATGGAGACAAAGTCTGTCGTAGGAATTAATGACCATTGGATGAAAAGTATGATGGAGCCGGCCGGCAGCGACAAACAAATACAGAACGACAATAGAAATTCTGTGAAGCGCGAAAAAATGGCAGAAGCGTACAACGGTCAAGACAACAATATAACGTACGAGTTAAAACAACAACAAGGCGAAGGGAGGTATCGTTGTGACGTGTGTGACCTCACCTTCGAGGGTATGGAGTTCTTGATGTTGCATAAAAAATTTCACGGAAACGACAAAGACGCGCCGGTGATGGAACCCGAGCCGGTAGTCGTTAAAGAGGGCCCGAAGAAGCGGAGGAAGTTCAAATGCGATCAGTGTGACGTCAAGGTCAACTCACAATACCATCTTGACATACACCACAGCAAGCACGAAGGTAATGCCTCCAAGAAATATATATGCCAAGTCTGTGACCGGAGCTTCGTTGCGGCCCAGTTCCTGAAGAGCCACATGCAATCCCACTCGCTAACAAGCACGATAAACTGCGAAATATGTAACAAGAGCTTCACCGGACAAGCATATCTCAAACTGCATATGCAACTACACAATGACATCAAGAAATTTAAATGTTCGAAATGCGACGAAATGTTCCCAACCAAGAACTGCCTTAAAATCCATATGAAAAAGCATACGAAGGTCAAAAATTTCAAATGCGACTGCTGCCATAAGCTGTTTGTATCAAAGATTACTATGGAGAAGCATCGTCAGAGCCACGAGGGGCAACCCATGGACTGTCATGTGAAATGTACTCAATGTGACAGGACTATGCATTCATCGTTACTAAGAACCCATATCGCTAGAGCTCATCCGTCAAGCCACGACGATGATGTCAAGAGACCATACAAGTGCTCGACCTGCGGCAAGAGTTTTATCGTAAAAATTAATCTAAATCTACACATACGGATGTGTCATCCCGACCTCGCCGAGCCAGAGGATGAGGACGATGATGTCATGACAGACAATTCCTAA

Protein sequence:

>DPOGS203091-PA
METKSVVGINDHWMKSMMEPAGSDKQIQNDNRNSVKREKMAEAYNGQDNNITYELKQQQGEGRYRCDVCDLTFEGMEFLMLHKKFHGNDKDAPVMEPEPVVVKEGPKKRRKFKCDQCDVKVNSQYHLDIHHSKHEGNASKKYICQVCDRSFVAAQFLKSHMQSHSLTSTINCEICNKSFTGQAYLKLHMQLHNDIKKFKCSKCDEMFPTKNCLKIHMKKHTKVKNFKCDCCHKLFVSKITMEKHRQSHEGQPMDCHVKCTQCDRTMHSSLLRTHIARAHPSSHDDDVKRPYKCSTCGKSFIVKINLNLHIRMCHPDLAEPEDEDDDVMTDNS-