Monarch geneset OGS2.0

DPOGS204382
TranscriptDPOGS204382-TA1725 bp
ProteinDPOGS204382-PA574 aa
Genomic positionDPSCF300002 - 1735548-1737350
RNAseq coverage166x (Rank: top 51%)
Annotation
HeliconiusHMEL0130750.081.69% 
BombyxBGIBMGA007089-TA3e-4131.28% 
Drosophilafu2-PA1e-5133.24% 
EBI UniRef50UniRef50_D6X4489e-10739.61%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X448_TRICA
NCBI RefSeqXP_972260.22e-10739.61%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|1892415993e-10639.61%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastxgi|2700008725e-11140.07%hypothetical protein TcasGA2_TC011130 [Tribolium castaneum]
Group
Gene OntologyGO:00056342.4e-15nucleus
GO:00082702.4e-15zinc ion binding
GO:00036766e-12nucleic acid binding
GO:00056227.3e-06intracellular
KEGG pathway 
InterPro domain[13-89] IPR0129342.4e-15Zinc finger, AD-type
[282-310] IPR0130876e-12Zinc finger, C2H2-type/integrase, DNA-binding
[263-285] IPR0070877.3e-06Zinc finger, C2H2
Orthology groupMCL18273 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204382-TA
ATGGATGATTCATTTATCGTGGCTAATTTTCATGAGTTATGTCGCCTCTGTTTACGCAAAAGTGAATTCACAATATCAATTTTTGGAGCTGTTCCTGACAATGAAGAAAATATATCTCTAACCTCGAAAATTGCTGAATGTCTCGAGCTGCAGATGGATCCCAATGATGGTCTCCCTACTAGGATCTGCTACAAATGCTTATTTAAGGTTAATAAATGCTCCAAATTTAGATTACAGTGTATTCAGAGTGAGGCCAGATTAAGACAAATAACGAATCGTGTGAATGAGTTAGATAATTCAAACTCATCAGAATTAAGTAATTATAATTTTAACAATACCCCAGAAACATTAAAACCAAAGGCTGAAGATTATATTGTTGAAGATAGTGTCGTGATGGTAGTAGATCCTAGCTTAGACTATGATTCTTCAGAGGAATCTGAGTATATAGATCAGACAGAAACGGAGACTTGTGAAAGAGATAACACACCTGACGGAGAAACAGCTTCGGAATCCTTTTATAAAAATGTATTTATGTGTCAATATTGTGATCAAGCATTTGTGTCTCAAGAAAAGTGCAAAGAACACGAGCAAAGTTTCCATGACCCAAATCTTCCTTATAAATGTGTGGAGTGCAGTCTAGTATTTTCGGAACGCAGTCAATTTGTTGCACACACTAGGCAAGTCCATGGTAATGATAAACCCTATCATTGCCCGGAATGTGACAAATGTTTTGGTAGACGCTCTGATTTAAGGAAACATTCTATAGTTCATACTGGTATAAGACCTTTTCAATGCCATTATTGTCTTAAGAGCTTCTCGAGGAATACAAACTTAAGTAAACATTTAAGAATACATGCAGGACATAAGCCTCATGTATGTCCTTTATGTCCGCGAAGCTTTGTAGCAAAAGGTGATTTACAGAGACATGTACTAGTTCACTCGGGTGTTAAACCTTATGCATGTAGGAAGTGTCCACTCACATTTGGACGAAGAGATAAGCTGATAAAGCATGAAGTCCGTCATGGACCTGTAAGTCCAGAAAATAAAGAATATGAAAATGATGATGCTCATGACATGGTTGTTAATGTAAACCCTTTTAGTAATTTAATGACATCCCCACCTCAGCACAATATTGAAAACACGAATGAATATGATTTGCCAAGGGTTCCGGACCATATTGCCGGTGATAGTACTTTTTTAAATCAAATTCAAAAGAATCCATCCACTTCTAGCAAACCTCAAACAAATTCACCACCTAAACCGAAAATGGCATCACCAAATAAAAATAAACCTAAGAATATAAAATGTCATCAGTGTCCAAAGAGGTTCTCCTCGTTAGATGCATACAAAACTCATGTGTCCATAGCACATATTGGATCTAGGATATTTCAGTGTAAGATATGCTTCAAAAAATTCCCTAGAAAAAGAGAATTTGATCGCCATGTAGCTTCTCATTCTGGTATGAAACCATTTAGTTGCAGTCAATGTGATAAAAAGTTTACGAGGAAAGATAAACTTAACAAGCATGAACAAACTCATGAATGTCTGGTTGTGAATATGCCTTGCATAGAATGTGGAGCAACATTTGAGAAAAAACCTGACCTAGTTGCACACATTAAGTCCCACTTTCCAGAAAATTATGATAACAAAATCCTTAATACTGAAATTAAGAAGGAAAATGTACCCGACTTCCCTTTGGACAATTTTTATGACTTGGAAACCTGA

Protein sequence:

>DPOGS204382-PA
MDDSFIVANFHELCRLCLRKSEFTISIFGAVPDNEENISLTSKIAECLELQMDPNDGLPTRICYKCLFKVNKCSKFRLQCIQSEARLRQITNRVNELDNSNSSELSNYNFNNTPETLKPKAEDYIVEDSVVMVVDPSLDYDSSEESEYIDQTETETCERDNTPDGETASESFYKNVFMCQYCDQAFVSQEKCKEHEQSFHDPNLPYKCVECSLVFSERSQFVAHTRQVHGNDKPYHCPECDKCFGRRSDLRKHSIVHTGIRPFQCHYCLKSFSRNTNLSKHLRIHAGHKPHVCPLCPRSFVAKGDLQRHVLVHSGVKPYACRKCPLTFGRRDKLIKHEVRHGPVSPENKEYENDDAHDMVVNVNPFSNLMTSPPQHNIENTNEYDLPRVPDHIAGDSTFLNQIQKNPSTSSKPQTNSPPKPKMASPNKNKPKNIKCHQCPKRFSSLDAYKTHVSIAHIGSRIFQCKICFKKFPRKREFDRHVASHSGMKPFSCSQCDKKFTRKDKLNKHEQTHECLVVNMPCIECGATFEKKPDLVAHIKSHFPENYDNKILNTEIKKENVPDFPLDNFYDLET-