Monarch geneset OGS2.0

DPOGS201360
TranscriptDPOGS201360-TA1557 bp
ProteinDPOGS201360-PA518 aa
Genomic positionDPSCF300083 - 394594-396876
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0147011e-6978.62% 
BombyxBGIBMGA000604-TA1e-7174.53% 
Drosophilascrt-PA4e-6673.33% 
EBI UniRef50UniRef50_A7URZ81e-6571.71%AGAP006794-PA n=18 Tax=Eumetazoa RepID=A7URZ8_ANOGA
NCBI RefSeqXP_001659830.19e-6842.82%hypothetical protein AaeL_AAEL009230 [Aedes aegypti]
NCBI nr blastpgi|1571211112e-6642.82%hypothetical protein AaeL_AAEL009230 [Aedes aegypti]
NCBI nr blastxgi|1571047472e-7045.05%hypothetical protein AaeL_AAEL014359 [Aedes aegypti]
Group
Gene OntologyGO:00036761.4e-16nucleic acid binding
GO:00082703.3e-05zinc ion binding
GO:00056223.3e-05intracellular
KEGG pathway 
InterPro domain[305-333] IPR0130871.4e-16Zinc finger, C2H2-type/integrase, DNA-binding
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201360-TA
ATGAACCAACCAGAATCATCAGCAGTCAACTATGAGGAGATAGGAAGCACACCCATTACTGAACGAACTGCTGAAGAGACAGAGGCGGCTCATGAATTATTGTCTCTCGCGCACAGCTTGCCACCTCTGCCGCCAGTCCCACCCCTAACACCAGCAACAACCGTGCCGGCGTTGCCGCAGTTCCCATCGCTGCCATCTCTGCCACAGGTGACGCAGCTGCCTCCGCTGCCGTCTCTGTCGTCACTTCCATCAATATCACCACTGTCATCGCTGTCAGGACTGCCTTCTGTCCCTCTCGTCTCCGTGCTACCACCGAATGAACCTGTCGTCCCAATTTACACCTATACCATACATCCTACTAATATATATATAATAGCTGAAGAGTCACGTGACCCCACGTATAATAACTCTGTGCCAACAATAACACCGATTCCTTGTGGCATCGAATATACAATTGAACCTCAATTAAGCTATCTTGCTTATCAACATGTACCGGAGGCGGTCATGCCAGTGGGTCATATATTGCTCCCTGCAGCCGAAATAATACCTAATAATAATCGACCAGAACCAAGCAGAGTCACTGATGTAGTGGAGCCACCAAGCAGGCTCCCTTGTTTAATGGAGCCCAAACGTCCAAAAATCAAACCGATTAATGCTCCAAGAGGGAAAAATGCTAAATATGACTGCAAAGAATGTGGCAAGCGGTACGCTACATCTTCAAACCTGTCACGCCATAAGCAAACACATCGCAGCCTTGATTCAGTTGCAGCGAAACACTGCGAGGATTGTGGCAAAGTATATGTGTCAATGCCAGCACTTGCGATGCACGTTCTGACTCATAGAATGGGCCACGTTTGCGGTATATGTGGTAAACAATTCTCAAGACCCTGGCTTTTGCGTGGTCACTTACGTTCCCACACGGGAGAGAAGCCTTATGACTGTCCATACGAAGGATGCCCTAAGGCGTTTGCGGATCGATCAAATTTACGTGCACATCTGCAGACTCATACAGGTGACAAAAAATTTGAATGCTCAAAGTGCCACAAAACTTTTGCTCTAAAAAGTTACTTGGCCAAGCATGAGGAAACGGTGTGCTTTCGAGATGAGATTGCTTGTGATCCGGAGTTAATTCAACCTACAATACCTGAAACATCGAGTATACAGTCTGACAAGCCATCAGAGCAGCTTAATACTCCCTCAGTGGGATTTGAACAGCCTGAGACGTCTCATAACCAGCTAAACTCTCCCCGTATTCAGGCTCCGCTTGCAGATGAAGCTCGAATAGACTCAGATTGTATTCAGCTTGAGACTAGAGAAATTAGTGATCAAATTATGTATACTGATTCTCAGACTGAAGGTATCCAGGCGGAAACAGCAGTGAGACGGTATGAGCCTCTACGTCCCATCGGTATACGATCTGATTTGGAACCTTTGTCACTGGAATTAGAAGATCCGCCGCAACCAGCAGTGATACGTTTTGATCCGTCTTGCGTTTTGCCCGAGCCAGAAATTATACGATACGATACGATGAATCTGGTGCCTGTGTTCGCGGAATAA

Protein sequence:

>DPOGS201360-PA
MNQPESSAVNYEEIGSTPITERTAEETEAAHELLSLAHSLPPLPPVPPLTPATTVPALPQFPSLPSLPQVTQLPPLPSLSSLPSISPLSSLSGLPSVPLVSVLPPNEPVVPIYTYTIHPTNIYIIAEESRDPTYNNSVPTITPIPCGIEYTIEPQLSYLAYQHVPEAVMPVGHILLPAAEIIPNNNRPEPSRVTDVVEPPSRLPCLMEPKRPKIKPINAPRGKNAKYDCKECGKRYATSSNLSRHKQTHRSLDSVAAKHCEDCGKVYVSMPALAMHVLTHRMGHVCGICGKQFSRPWLLRGHLRSHTGEKPYDCPYEGCPKAFADRSNLRAHLQTHTGDKKFECSKCHKTFALKSYLAKHEETVCFRDEIACDPELIQPTIPETSSIQSDKPSEQLNTPSVGFEQPETSHNQLNSPRIQAPLADEARIDSDCIQLETREISDQIMYTDSQTEGIQAETAVRRYEPLRPIGIRSDLEPLSLELEDPPQPAVIRFDPSCVLPEPEIIRYDTMNLVPVFAE-