Monarch geneset OGS2.0

DPOGS204915
TranscriptDPOGS204915-TA1371 bp
ProteinDPOGS204915-PA456 aa
Genomic positionDPSCF300340 + 26087-28319
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0108241e-12655.97% 
BombyxBGIBMGA001690-TA1e-11249.10% 
DrosophilaCG15436-PA1e-0822.73% 
EBI UniRef50UniRef50_E0VYX96e-1526.81%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VYX9_PEDHC
NCBI RefSeqXP_002431323.11e-1526.81%hypothetical protein Phum_PHUM521410 [Pediculus humanus corporis]
NCBI nr blastpgi|2420217842e-1426.81%hypothetical protein Phum_PHUM521410 [Pediculus humanus corporis]
NCBI nr blastxgi|2420217842e-1826.81%hypothetical protein Phum_PHUM521410 [Pediculus humanus corporis]
Group
Gene OntologyGO:00056341.2e-07nucleus
GO:00082701.2e-07zinc ion binding
KEGG pathway 
InterPro domain[15-82] IPR0129341.2e-07Zinc finger, AD-type
Orthology groupMCL26580 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204915-TA
ATGGCTACATTAATGGATTTGCCTTTAAACGATGTATCATTAGTATGTCGCAGTTGTCTAGCCGCTTCTGGTGATATGAAAAATATGACGGAATGGGGATACGCTGAAGATTTTTATAAATTAACCAACATACAGATAAATCGATTGGATAATATAACGGAACTGCTCTGCACCAGTTGCGAGGGTCTGGTTCTGAACTACAGGAGTTTTGTGCAACAATGTCGCCAGTCCGATCGCTTGCTGAAGGATATGAGGCAAAAATCACTTACACCGAACACCAAAAAGGACGTAGAAGAGAACTCAATAGCCTTGGAAATAAATGAAAATGAATTGACATTTAAAATAACAGCCCCGGACATGGAGTCAAAACTATACTTACCATGCAACCAGTGTCACGACACATTTGTGAAGAAAAACGACTTGATCAGCCACATGACGAAGAAACACAATAATCATGACAATATAATAATAAATATGAAGTTTTTTTGTTCGTTTCCCGATTGTCCCTACAACGTTCTATCCGGTAAAGACAAACACTTTTCCGGCAGGAAGTATCTCAATCAGCACATTTTCAAAGTCCACAAGGCTAAAAACTTGTTATGTCACAATTGTGACCTAACATTCAGCAGCGATCAGGACTTCAGACGACACCTGAAGACCTGCAACTACATCTACATCTGCAAAGTCTGCGACATACAGTATAAGACGAACGAAAAGCTGCTTGTACACTTACTGAGGAAACATCCAGATTTGCACAAACAGTACAAGATTGAGCGGAAAGCGGAGAAGCGGAAGATTGAAATTGAAGAACCAAAGAAGTTAAGGGCCCACAACGAAAATCCCGAACTGACATGCGATAGTCCCAAGCGGTCATTCGCGACACAGACGTTCGAATTGAAGAGGCAGATAAAAAACGATGTGACGTTGCCCTCGTGGCTGGCCGACAAACAGGATGAAACCAAAAAAGACGAAATATCCACTCAAACTGTGTTCGAAGACATCCTGTCTGTTAAATCACAGAACAGTGAGGACGATCTTTTCTTCTCGGAGACGGTATCACTATCTGATATCCAGACTCAGACGATTCCCTTGGAATTTGGACTCAGTAGATCAAACAAGCAAACGATCACCTCAGAGACTCAGTCCCCAGATCTGAGCATGAAGGAAACTCAAACCTGCCTCTGCTTATATGAAACTCACAAACCGAATAGATGCCTTGAGGGCATTCCGTCAAATTCCAATAGCAATTTCTGTACACTCACGAGCACTGAAACACAGACTCATAGATTTCATAGTCAAGATAGTGATTCGTTGATGAGTTTCACTTCCACCGAAACTCAAACCTGCTTCGACGATCTGAACAAGTTATGA

Protein sequence:

>DPOGS204915-PA
MATLMDLPLNDVSLVCRSCLAASGDMKNMTEWGYAEDFYKLTNIQINRLDNITELLCTSCEGLVLNYRSFVQQCRQSDRLLKDMRQKSLTPNTKKDVEENSIALEINENELTFKITAPDMESKLYLPCNQCHDTFVKKNDLISHMTKKHNNHDNIIINMKFFCSFPDCPYNVLSGKDKHFSGRKYLNQHIFKVHKAKNLLCHNCDLTFSSDQDFRRHLKTCNYIYICKVCDIQYKTNEKLLVHLLRKHPDLHKQYKIERKAEKRKIEIEEPKKLRAHNENPELTCDSPKRSFATQTFELKRQIKNDVTLPSWLADKQDETKKDEISTQTVFEDILSVKSQNSEDDLFFSETVSLSDIQTQTIPLEFGLSRSNKQTITSETQSPDLSMKETQTCLCLYETHKPNRCLEGIPSNSNSNFCTLTSTETQTHRFHSQDSDSLMSFTSTETQTCFDDLNKL-