Monarch geneset OGS2.0

DPOGS209891
TranscriptDPOGS209891-TA2142 bp
ProteinDPOGS209891-PA713 aa
Genomic positionDPSCF300049 - 407666-411842
RNAseq coverage378x (Rank: top 32%)
Annotation
HeliconiusHMEL0104860.064.36% 
BombyxBGIBMGA000189-TA0.065.69% 
DrosophilaCG5608-PA3e-16243.89% 
EBI UniRef50UniRef50_Q08AM68e-16540.90%Protein VAC14 homolog n=66 Tax=Coelomata RepID=VAC14_HUMAN
NCBI RefSeqXP_968541.10.048.11%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|3071988470.047.91%Protein VAC14-like protein [Harpegnathos saltator]
NCBI nr blastxgi|3071988472e-17947.91%Protein VAC14-like protein [Harpegnathos saltator]
Group
Gene OntologyGO:00054886.3e-38binding
KEGG pathwayani:AN5527.21e-54 
 K10875 (RAD54L, RAD54)maps-> Homologous recombination
InterPro domain[473-652] IPR0218415.2e-66Protein of unknown function DUF3434
[11-581] IPR0160246.3e-38Armadillo-type fold
[25-582] IPR0119895.6e-33Armadillo-like helical
Orthology groupMCL14355 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209891-TA
ATGACGGATCGAGATTATGCACCGTTGAGCTCTGCTTGTGTCCGCGGACTATGTGATAAATTATATGACAAAAGAAAATTTGCAGGCGTTGAGATAGAAAAAATGGTGAAGGATTTTAACGATGCGAATAATACAAGTCAAATAAAACGTTTGATTCGTGTGCTCGGTCAGGATCTAATGTCATCCACCAATCCCAATGTTAAAAATGGAGCTCTTATGGGACTCTCGTCTGTTGCAGTTGGTTTGGGAAAGGGAAGTGTAGATTATATGGGAGAATTGATTCATCCAATCATAGCCTGTTTGGGCGAGAGTGAAGCCAGGGTGAGATACTCTGCTACAGAAGCATTGTTCAATGTATTGAAGATTGTGCGCAGTGCGTCACTCACTCATTTCCCATTAGTATTTGATGCATTGGCTCGACTGGCCGCTGACCCAGAACTACAAGTCCGTCAGGGAGCGGAGCTGCTTGATAAACTCGTCAAAGACATAGTGAGTGAGAGTGGTACAGTGGACGTGTCGCTGGTGGTGCCGCTGGTCCGGGAGCGACTGTATGCGCGGTCGGCGGCGGCGCGCGTGTTCGGCGTGGGCTGGCTGAGCGCGCTGGACGCCTCGCCCGCCCTGGGACTCCGGGCTCACCTGCCGCTGCTGCTCGATCCACTGTTCACCGTTCTTGATGACCCCAACCCGGAAATACGTCGCATGTGCGACGTGCAACTGAACGAGTTCCTCCGAAGCATCAAGAAGGACCCCTCGGAGGTGGACTTCGAGTCTATGATCAACATCCTCATCACACACGCCCAGTCCACCGAGGAACTGCTGCAGCTGACGGCGCTGACATGGCTGAAGGAGTTCGTGAACGTGTGCGGGCGTCGAGTGCTGCCCTCGGCCTCCGGCGCCCTGGCCGCCGCGCTGCCCTGTCTGGCGCTCGCCGACCACTCCGACATGAGGACCAAAATCCGAGAAACGGCGGCGGCTGTGAACCATCAGCTGATAAAGCTGGTCGTTGAAAAAACAGAGGGCTCTCATGAGAAGCGAGCCGAGGGTGACGACACGAGGGCTTGTCTCAACTTGGAGGCGGTGGTGGGAGTTTTGACACAGATGTTACACCACAGCTCCCTGCATACCAAGGTCGCCGCTCTCGACTGGATCCTACATCTATATAACAAGTTACCAAACGAGATGTTTCTTCAGACGGAACGTGTGTTTTTGAGTGTGGTGGGCAGCTTGGCTGACCCGGCGGACGACGTGGTGCGCCGCGCCCTAGCCGTCCTCGCTGAGATATGTTCCTGCCACACCGCCACCACTACTGCCACCACCACCACGAGCAGCGACACCGTTACCACCACCACAAGTGACCTTGAATCCAGTCCGTACTATCACAAATTCCTGAAAGCTCTGTTGAGACTGCTCGCAGCCGATGAGAACCTCCTAGAGGACAGGGGATCGTTTATCATAAGACAGCTGTGTGTGCTGGTGGGGGCGGAGGCTGTGTACCGCGGCGTAGCGCTGTCTCTCCGCGGGGAGCGCGAGCTCCGATTCGCGGCTCGCCTCGTGGACGTGCTGGACACTCTGCTGCTGACGGCCGCGGAGCTGCATCACCTGCGCCGCAGTCTGAGAGCCTTCTCCGACCCGGCGACGGTGTCCCTGTTCGAGACGCTGTACGAGTGCTGGAGCCACAGCCCGGTGGCGCTGCTGGCTCTCTGTCTCCTCACACACAACTACCAGCACTGCAACACGCTCATATCTACATTTGGGGACTTAGAGATAACGGTGGATTTCCTCACCGAAGTGGACAAACTGGTCCAGTTGATCGAGTCGCCGGTGTTCGCCTATCTCCGCCTGGAGTTGTTGGACGACGAGCGCAGTCGCCCGCTCCGTTCGGCTCTTTTCGGCCTCCTGATGTTGTTGCCTCAGAGTGAAGCGTTTCACTCTCTCCGCCGACGCCTGCACTGCGCCCCTCCCCCTCCCCCTCTCACAGCTCGCCCCCAGCAGACGGAACCCCCAGCTCCCCAGTCCCCCCCTGGACTTGATTTCGACGCGTTGCTGGCGACCTTTCAGCGCGTTCAGGCAGCTCACCGTGACTACCGTACGCTGGAGAGACGACTCAGCCGCGCTAAGCTCTTTGATAGAACGTACTCATAG

Protein sequence:

>DPOGS209891-PA
MTDRDYAPLSSACVRGLCDKLYDKRKFAGVEIEKMVKDFNDANNTSQIKRLIRVLGQDLMSSTNPNVKNGALMGLSSVAVGLGKGSVDYMGELIHPIIACLGESEARVRYSATEALFNVLKIVRSASLTHFPLVFDALARLAADPELQVRQGAELLDKLVKDIVSESGTVDVSLVVPLVRERLYARSAAARVFGVGWLSALDASPALGLRAHLPLLLDPLFTVLDDPNPEIRRMCDVQLNEFLRSIKKDPSEVDFESMINILITHAQSTEELLQLTALTWLKEFVNVCGRRVLPSASGALAAALPCLALADHSDMRTKIRETAAAVNHQLIKLVVEKTEGSHEKRAEGDDTRACLNLEAVVGVLTQMLHHSSLHTKVAALDWILHLYNKLPNEMFLQTERVFLSVVGSLADPADDVVRRALAVLAEICSCHTATTTATTTTSSDTVTTTTSDLESSPYYHKFLKALLRLLAADENLLEDRGSFIIRQLCVLVGAEAVYRGVALSLRGERELRFAARLVDVLDTLLLTAAELHHLRRSLRAFSDPATVSLFETLYECWSHSPVALLALCLLTHNYQHCNTLISTFGDLEITVDFLTEVDKLVQLIESPVFAYLRLELLDDERSRPLRSALFGLLMLLPQSEAFHSLRRRLHCAPPPPPLTARPQQTEPPAPQSPPGLDFDALLATFQRVQAAHRDYRTLERRLSRAKLFDRTYS-