Monarch geneset OGS2.0

DPOGS206947
TranscriptDPOGS206947-TA1380 bp
ProteinDPOGS206947-PA459 aa
Genomic positionDPSCF300001 - 366453-372738
RNAseq coverage584x (Rank: top 22%)
Annotation
HeliconiusHMEL0143690.074.34% 
BombyxBGIBMGA012953-TA2e-14989.71% 
DrosophilaTango5-PA2e-10952.52% 
EBI UniRef50UniRef50_Q0IEV82e-11850.23%Vacuole membrane protein n=5 Tax=Culicidae RepID=Q0IEV8_AEDAE
NCBI RefSeqXP_969557.12e-13551.75%PREDICTED: similar to vacuole membrane protein [Tribolium castaneum]
NCBI nr blastpgi|910854573e-13451.75%PREDICTED: similar to vacuole membrane protein [Tribolium castaneum]
NCBI nr blastxgi|910854571e-13351.96%PREDICTED: similar to vacuole membrane protein [Tribolium castaneum]
Group
Gene OntologyGO:00036763.8e-09nucleic acid binding
KEGG pathway 
InterPro domain[40-70] IPR0030343.8e-09DNA-binding SAP
Orthology groupMCL14154 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206947-TA
ATGACAAAATTAGCTGGCACAAGGCGAGCACGCAACCAAGCTACATCTCTTGCTATACCACGTCAAAACTCTATAACTGATAATAGCAAGACTCGTCAAAATTTAAGTCCAGAAGCTCTGGCGTCTATGAGTAAGGAACAACTCAGAGCTGAATGCAGAAAGAGAGGCCAACGGACTACAGGCAACAAAAATGAACTGCTATCCCGTTTGGGATACTACAAGCTGGCGGCCGCAGTCCAAGTAGAAACGGAGCAGCGTTCGAGGAACGGTTTGCACTCACCTCCTAAGGAGACCAAAACGAAAGCCGCTTTAATGGATACGGAAAGTTTGGTATTATGGCGGAAACCTTTTACGACGCTCGAATATTTTTTTAGAGAACTCCTTATATTATGCTCGACGGGCTTAAAGAGATTGTTAGCATACAAAGCACTAGCCTTATTTATATTTCTAACAATCATCAATATCGCCTTCTCCTACCACATCAAGGGCCCCCACCAACCTTATGTGCAGGAGGCAATTCAGACCCTGGCTTGGTGGGGATGGTGGGTGATTCTGGGCGTGTGTAGTTCAGTCGGTCTCGGTACCGGCCTGCACACATTCCTATTATACCTCGGGCCCCATATAGCGAGGGTCACACTGGCAGCGTACGAATGCGGAGGCTTGAACTTCCCGTCGCCGCCTTACCCCAATGATATAATATGTCCGAGCGAGGTGGATCCTAACAATGCAGTGTCTATATGGAATATAATGGCCAAGGTCCGTATAGAGTCCATGATGTGGGGCATCGGTACGGCTTTGGGTGAACTGCCCCCATACTTCATGGCACGAGCCGCTAGGATCTCCGGTGGTAGCGTGGAGGGGCTCAACGAAAAAGACGATTCTAGGACCGGGAGGGCCAAGGTGATGATTCAGAAGTTGGTGCAGAAGGTTGGTTTCGCTGGTATATTGGCTTGTGCGTCTATACCAAACCCGTTGTTCGATTTGGCCGGTTTGACTTGCGGACATTTTCTCGTACCGTTTTGGACATTCTTCGGAGCCACGGTCCTCGGTAAGGCCGTTGTCAAGATGCATCTGCAGAAGATGTTTGTCATCGTCGCCTTCAATGAGACTCTAGTCGGACAAGCCCTATCCTGGGTTGAAAAAATACCGTACGTAGGACCGAAGTTGGAAGCTCCATTGCTAGAATTCCTGAGGAATCAAAAGGCTCGTTTACACAAGAATGATAATACGTCACAAGAGAACCAAGGCTCAATACTGTCGAGCATCCTGGAGAAATTTGTATTGGCAATGGTGTTGTACTTCATCGTGTCCATAATAAATGCACTCGCACAAAACTACAACAAACGCAGCACCAAGAAGAAAGGCAAAAAGAAGGAGTAA

Protein sequence:

>DPOGS206947-PA
MTKLAGTRRARNQATSLAIPRQNSITDNSKTRQNLSPEALASMSKEQLRAECRKRGQRTTGNKNELLSRLGYYKLAAAVQVETEQRSRNGLHSPPKETKTKAALMDTESLVLWRKPFTTLEYFFRELLILCSTGLKRLLAYKALALFIFLTIINIAFSYHIKGPHQPYVQEAIQTLAWWGWWVILGVCSSVGLGTGLHTFLLYLGPHIARVTLAAYECGGLNFPSPPYPNDIICPSEVDPNNAVSIWNIMAKVRIESMMWGIGTALGELPPYFMARAARISGGSVEGLNEKDDSRTGRAKVMIQKLVQKVGFAGILACASIPNPLFDLAGLTCGHFLVPFWTFFGATVLGKAVVKMHLQKMFVIVAFNETLVGQALSWVEKIPYVGPKLEAPLLEFLRNQKARLHKNDNTSQENQGSILSSILEKFVLAMVLYFIVSIINALAQNYNKRSTKKKGKKKE-