Monarch geneset OGS2.0

DPOGS200484
TranscriptDPOGS200484-TA2112 bp
ProteinDPOGS200484-PA703 aa
Genomic positionDPSCF300158 - 139998-149892
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0043870.076.80% 
BombyxBGIBMGA010420-TA3e-14176.61% 
DrosophilaCG12071-PB2e-7952.75% 
EBI UniRef50UniRef50_Q86PF23e-7752.75%CG12071, isoform B n=13 Tax=Drosophila RepID=Q86PF2_DROME
NCBI RefSeqXP_002136900.14e-7849.86%GA26919 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984494597e-7749.86%GA26919 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1984494591e-11843.04%GA26919 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00036761e-12nucleic acid binding
GO:00056349.3e-12nucleus
GO:00082709.3e-12zinc ion binding
KEGG pathway 
InterPro domain[410-439] IPR0130871e-12Zinc finger, C2H2-type/integrase, DNA-binding
[30-104] IPR0129349.3e-12Zinc finger, AD-type
Orthology groupMCL18359 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200484-TA
ATGGACAACTCCTGCCTCCTCGACCTCCTCATCAGGTGCTTCGCCATGACCGAGCCCGAGGACCTCAACTTCACGGCCTTCACGGACCTCTGTCGACTGTGCTCCCTCAAGAGCGGACCCAGGCTGCACATATTCGACAAGGAATCCGAGCAGAGGCAGATCCTTTTCAAAATGAGGACCTGCTTACCGACTATGCAGATATCGAAAGACGACTTCCTGCCCAAGAAGATCTGCGAGCGCTGCGTCACGAGATTGGAACAATTGTATGAATGGAGGCAGAGCTGCCTCAACACGGACTCGGTGTTGAGGAACTACGCGGAGTCTATGAGGATTGTCACTTCTACTATAAACTTTCAGCACAGACGTGATAAGGCCCTACGAATTCGACCCTCTATCGGTAATCTTCGAGGGGGCCTCCCTAAACAAGCTGACGCAAATTCAGATCACTCAAGTGTTCCTATTAGCGTAGTGTCGCTGATAGTGAAGGACGGCACTGTTAACATGGACAAAATGACGGAAGCACAAAAAACGGCATACTTAGAGGCACACGTAGCGGTACAACAACATATGGCTCAGGCCGCAGCTGCGGCTAGACGAGAACAACAACAACAACAGCAGCAACATCAGCAACAGCAACAACAACAGCAGCAGCAGCAGCAGCAACATCAACAGCAGCAACAACACCAACAACACCAGCAACAACAACAGCAACAGCAGCAGCAGCAACAACAACAACAACAACAATCGAATAATTCTAATAACAACTCACGTCATCACCAAGACATGCAACAGGCGCAGAGTAATACACACCCTGGTCACCCATCGCCACCGACCGTCAGTTCTACACAGCATCACTACGCTACCATCAACTCCCCCATCAAGGTACCTCAAGTCAGTTCCAGTACTGGAGGTCCAGATTCAACGTTCACAGTACCAGACGACGTTGGCATGGGATTTGAAGGCGGTGTTCGTGTTCTACAAAGCCTCGGAAATTGGTCACCAGAAGTTACCAATACGATACCTAGGCCGAATTTAATACCGTTCACTGAACCGTACGCGGAAGGCGGATCTATGCATCCAGGGACTCGTCTGAAAGCTCTCCAAGGAACCGCAATCCGAAAACATCCTGCCATTAAACCTACCAGCTCTACGAATGAAGGAAGCAAGGCATTTGAATGCACTGTCTGCGGTAAGGGATTAGCGCGCAAAGACAAACTAACAATACATATGAGGATACATACTGGTGAGAAGCCATACGTCTGTGAAGTTTGTAATAAAGCGTTTGCAAGAAGGGATAAGTTAGTTCTTCATATGAACAAATTGAAGCACATCACTCCATCAAATATAGCGCCATTGGGTAAACGGACTATTACAATACCTCCTCTCAAGCTGGATGACGTTAAGCCTCAACTAACCAAGCAAGAAGATACTAAACCAAGTCAGGCCGAGATCGCAGCTGTTGCTCAGCAGCAGCAGCAACAACAACAGCAGCAGCAGCAGCACTCCCATCAGCAGTCGATTCAGGTCTGCCAGGTACCTGGACACAACTTTACGAATACTAATCTAGTTCACACAAGCGCTTCAAGTGGCTCCGTGTCAGCGGCCACAGCAGGGATGGTCGTGTCGTGGTCGTGCGAGCTCTGCGGAAGGTTATTTGCTACCAGAGACGAGTGGAGTGCACACGCCAAGTCCCACCTGCCAGACAATAAGCTCCTACAGGATAAAATGCAACAAGCTACACAGCATCAGCAACAGGAAAAGCTTCATTTAACGAATCAGGAAAAACTACAGCTACTTCATCATCACAATCAGAATCAGCAGATCTTGAATAACCACGGCATTGCTACCTCCGTATCATCTGCGACTGTTGTTAACGAAGGAGGAGGGGGTGGGGGAGGGGGAGCATACTTCTCCCACGGTCACACACACTACGCACCAGAGAGGCAACATACACACGCACATCACCTTTGCCTGATGTGCCGTCAAGAGTTCGCTGGTAAAGCAGAGTTTATGTTCCATGTTAGAGGACATTTCGAAGGTAAAGTGAGCGATATAGCAGCAGCAGATGTATTGGCACGATCGCTAGTGGATAATTCCGGTCTTTGCACCTGA

Protein sequence:

>DPOGS200484-PA
MDNSCLLDLLIRCFAMTEPEDLNFTAFTDLCRLCSLKSGPRLHIFDKESEQRQILFKMRTCLPTMQISKDDFLPKKICERCVTRLEQLYEWRQSCLNTDSVLRNYAESMRIVTSTINFQHRRDKALRIRPSIGNLRGGLPKQADANSDHSSVPISVVSLIVKDGTVNMDKMTEAQKTAYLEAHVAVQQHMAQAAAAARREQQQQQQQHQQQQQQQQQQQQQHQQQQQHQQHQQQQQQQQQQQQQQQQQSNNSNNNSRHHQDMQQAQSNTHPGHPSPPTVSSTQHHYATINSPIKVPQVSSSTGGPDSTFTVPDDVGMGFEGGVRVLQSLGNWSPEVTNTIPRPNLIPFTEPYAEGGSMHPGTRLKALQGTAIRKHPAIKPTSSTNEGSKAFECTVCGKGLARKDKLTIHMRIHTGEKPYVCEVCNKAFARRDKLVLHMNKLKHITPSNIAPLGKRTITIPPLKLDDVKPQLTKQEDTKPSQAEIAAVAQQQQQQQQQQQQHSHQQSIQVCQVPGHNFTNTNLVHTSASSGSVSAATAGMVVSWSCELCGRLFATRDEWSAHAKSHLPDNKLLQDKMQQATQHQQQEKLHLTNQEKLQLLHHHNQNQQILNNHGIATSVSSATVVNEGGGGGGGGAYFSHGHTHYAPERQHTHAHHLCLMCRQEFAGKAEFMFHVRGHFEGKVSDIAAADVLARSLVDNSGLCT-