Monarch geneset OGS2.0

DPOGS200014
TranscriptDPOGS200014-TA1254 bp
ProteinDPOGS200014-PA417 aa
Genomic positionDPSCF300225 - 233892-239358
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0045227e-8571.12% 
BombyxBGIBMGA007977-TA4e-1225.82% 
DrosophilaCG14142-PB1e-8445.79% 
EBI UniRef50UniRef50_Q7QC294e-11553.33%AGAP002421-PA n=2 Tax=Anopheles RepID=Q7QC29_ANOGA
NCBI RefSeqXP_001601488.16e-11551.07%PREDICTED: similar to ENSANGP00000014790 [Nasonia vitripennis]
NCBI nr blastpgi|3071966801e-11452.27%UPF0526 protein [Harpegnathos saltator]
NCBI nr blastxgi|3479678349e-11353.33%AGAP002421-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
Orthology groupMCL16134 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200014-TA
ATGAACACTATAATAAGGAAACAGGCGCTAGGAACATCGCAGGAAGAGTTTTATGAACGTGTGTTATACGCAAGAAACCGGCAAGTTGGTGCTCGACCTTCAGTGGTTGGTGGCAAACCTATTACTGAAGATTTGGCTATTGAGCTTCGTATAACAGCCTTTGGAACTGCGTCATGTCCGCCTCGTGGGGAGTGGATACACACTCCACTTGTGATGAGGCCACCTGATCATCCTTTAGCGTATGGCCTGGCAGCTCCCAGAAATGGAACTCGATCCCTACTGTCGGGTCTTCAGGCTCACATCATCAAATGGCTTCTGTTCGACTCCCGGCCTCTGACAAAAGATAACAAATCTGTGGAACCACCTGACAGTTACCTTCGTCCATCCGAAGAACGTCAAGAGGAAGCACTGTGGCGGGCGTGCAGTGAAGTTATCTGGCGCTGCGGTGGGGGTTTTAATGCCCAAACCGATACCAAGGTTACGGTGACACTTCCGACCAATCAAGTATACATACAACATAGTTCACAATATTACCAAGATGGAATCACTGAGATGCTGCACTTGTTTGAATTCAAAAGTTTGGAGGACCTGCAGATTTTTTTGAAGCGATATTTGTATTTGTTTCAGTCTGAAGACGGCTCCGGATCCTTGTTGCTGCTCTACGCCTGCATTTTATCAAGGGGTTGCGAGAATGTTAAAAAAGATCTTGATGGTAAACTGACCTATTTAGTTTCCACGCAAGTTGAAGGGTCTCTTAACGTGACGACTCTCCTACTCACGGGCCGTGCTACACCTTATTTGCATAACGGAGTACAATATGTGGGCGATGAAGATCATTATGCAATGCCGCAATTTGGCGTTCTATCCAGAAGTTCAGTAGGTCTTCTCGTATGGTACGGAAATGAGGAAAACGTCGGCTGCAACGTATCCAAACAGTACCCTGGATCTCGTCTGAAAACACCAGCAATGCCTATTTGGGTAACAAGTTGCTCAGGACACTATGGGGTACTGTTCAATACTAATCGCGAGCTTCTGAGAAACTATCACGCTGAAAGAAGGTTTGATATTCACTATTACACCTGTGGTGGATGCCACGTTCTCTTGAATGTGGACACTCGAGCTCACGAAGACATGGTGCAATTGAGAAATGATGACATCAGCGCCACACCGCTTGAGAAACTCATTCACACTAAGTGGCAAGACGCCAAGATCACTTGGTCTGGCCCCGTGCCCTTTGCGGATTCTCCCAACTAG

Protein sequence:

>DPOGS200014-PA
MNTIIRKQALGTSQEEFYERVLYARNRQVGARPSVVGGKPITEDLAIELRITAFGTASCPPRGEWIHTPLVMRPPDHPLAYGLAAPRNGTRSLLSGLQAHIIKWLLFDSRPLTKDNKSVEPPDSYLRPSEERQEEALWRACSEVIWRCGGGFNAQTDTKVTVTLPTNQVYIQHSSQYYQDGITEMLHLFEFKSLEDLQIFLKRYLYLFQSEDGSGSLLLLYACILSRGCENVKKDLDGKLTYLVSTQVEGSLNVTTLLLTGRATPYLHNGVQYVGDEDHYAMPQFGVLSRSSVGLLVWYGNEENVGCNVSKQYPGSRLKTPAMPIWVTSCSGHYGVLFNTNRELLRNYHAERRFDIHYYTCGGCHVLLNVDTRAHEDMVQLRNDDISATPLEKLIHTKWQDAKITWSGPVPFADSPN-