Monarch geneset OGS2.0

DPOGS204954
TranscriptDPOGS204954-TA1176 bp
ProteinDPOGS204954-PA391 aa
Genomic positionDPSCF300160 + 548870-550198
RNAseq coverage6x (Rank: top 87%)
Annotation
Heliconius% 
BombyxBGIBMGA013607-TA3e-1824.93% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020605798e-2828.74%UPI0002060579 related cluster n=1 Tax=unknown RepID=UPI0002060579
NCBI RefSeqXP_001599720.19e-1734.55%PREDICTED: similar to ENSANGP00000022132 [Nasonia vitripennis]
NCBI nr blastpgi|3286968663e-2728.74%PREDICTED: zinc finger BED domain-containing protein 4-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3286968661e-2728.21%PREDICTED: zinc finger BED domain-containing protein 4-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00036762.3e-32nucleic acid binding
GO:00469833.2e-11protein dimerization activity
KEGG pathway 
InterPro domain[1-310] IPR0123372.3e-32Ribonuclease H-like
[225-294] IPR0089063.2e-11HAT dimerisation
Orthology groupMCL14542 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204954-TA
ATGACTCACGTCTTAAACCTTGTAGCCGATGGAGTATTGAAAGAAATTAAAGAATTTTCAGCACTCACCGACCAGGTCAAAACCATTGTGACGTTCTTTAAGCAGTCAGTCAACTCTATGGACCAATTACGAGCCGAACAAGAATCATCTGGTAAAAAAGAAGGTGAGGTGTTGACATTGATACAAGCTGTCAGCACTAGATGGAACTCTTGCTTTGACATGCTAGAGAGATTCGTCAAACTGTCTGCACTAGTAGCTAAAATTTTAGCCACAAAAAGCCAAACTAGCAAGAATACACCAGATATCGTTCCATCGTCGCAACTTAATGTCATACGGGATTTTATTGCTCTTCTTGGCCCTTTCAAAGAAACTACTGAGGAAATAAGTGGTGCGAATTATGAAACGTCCAGCTTAGCCATTCCTCTTACAACTCTAGTTACCCAAGTAACAGATCAAGCAACTCCATCGACCTCTTTAGGGCTGATTGTAAAAGGCTTTACTCAGTTGTCGTGTCAAATGCTATATCCAAAATTAGTGCTGAGATTCGTGCAGAACATCGCCGTAGAGGACAACGTTCTCCGGATAAATATCCTGATCAAGCAGCATCAGAACAAGCTAGTGGGTCCTCTATCTGCTACGTCAGAGGTTCCAAGCAGTGGCTGTGTCCCCAATGAACTGAAGCAGTATTTGGACCAACCACTTTTAGATAGAAAATCTGATCCAATAAAGTTCTGGATAAAATGTCGTCATTTCACGCCGGTTCTCTCGGATATCGCTCTAAAATATATGATTTGCCAAGCTTCTTCAGTGTCATCTGAAAGGGTTGCTTCAGTGGTGAATTTGGCTGTACCGAATGAAAGGAGCAGATTAACAGGGGACCACATCAAACAAAGAGTGCTCCTGATATGTGTATTTGCTTTAAAATTTCGAATATTTTTCGGAGCCTACGCAATTGGTTCGTGGGACTCTAGGGGTGGCGCGGTCGTGTGGCCGTATGGCCTGGCCGGTGAGTGCGGGCCGGCCTTGTGGTCGTTAGACTCTGGGCTGGCCGGCGTGTGGGTCGGAGAGGGGCCCGAGGACCCCCATCCGTGGGCTAGGGCTCCTTCTCGGAATCTGTGCGAAGGGGCTGGCGTGCCGGTCTCGGTGTGTCGGTCATCTTGCTCCCCTGTCCTCTGA

Protein sequence:

>DPOGS204954-PA
MTHVLNLVADGVLKEIKEFSALTDQVKTIVTFFKQSVNSMDQLRAEQESSGKKEGEVLTLIQAVSTRWNSCFDMLERFVKLSALVAKILATKSQTSKNTPDIVPSSQLNVIRDFIALLGPFKETTEEISGANYETSSLAIPLTTLVTQVTDQATPSTSLGLIVKGFTQLSCQMLYPKLVLRFVQNIAVEDNVLRINILIKQHQNKLVGPLSATSEVPSSGCVPNELKQYLDQPLLDRKSDPIKFWIKCRHFTPVLSDIALKYMICQASSVSSERVASVVNLAVPNERSRLTGDHIKQRVLLICVFALKFRIFFGAYAIGSWDSRGGAVVWPYGLAGECGPALWSLDSGLAGVWVGEGPEDPHPWARAPSRNLCEGAGVPVSVCRSSCSPVL-