Monarch geneset OGS2.0

DPOGS206102
TranscriptDPOGS206102-TA1521 bp
ProteinDPOGS206102-PA506 aa
Genomic positionDPSCF300028 + 205361-207501
RNAseq coverage325x (Rank: top 35%)
Annotation
HeliconiusHMEL0057750.094.66% 
BombyxBGIBMGA006828-TA0.093.48% 
DrosophilaCG9987-PA0.089.33% 
EBI UniRef50UniRef50_Q9Y3I00.082.31%tRNA-splicing ligase RtcB homolog n=137 Tax=root RepID=RTCB_HUMAN
NCBI RefSeqXP_969671.10.089.92%PREDICTED: similar to GA22169-PA [Tribolium castaneum]
NCBI nr blastpgi|3323757150.090.12%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|910937590.089.92%PREDICTED: similar to GA22169-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[1-506] IPR0012330Uncharacterised protein family UPF0027
Orthology groupMCL13677 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206102-TA
ATGGTCGTGCGGCAATATAACGAGGAGTTGAAATATTTAGAGAAGGTGAATCCACACTGTTGGAAGATTAAGAAAGGCTTCCAACCTAATATGAATGTTGAAGGTGTATTCTATGTTAATAATACACTTGAAAGACTAATGCTGGAAGAATTGAAAAACTCCTGCCGGCCGGGTATGACAGGTGGATTTCTTCCTGGTGTAAAACAAATTGCGAATGTGGCAGCTCTTCCTGGGATCGTTGGTCGATCTGTCGGTCTTCCAGACATTCACTCTGGTTATGGATTTGCTATTGGTAATATGGCAGCATTTGACATGTCTAATCCAAAGTCAATTGTGTCTCCTGGTGGAGTGGGCTTTGACATAAATTGTGGTGTTAGACTTTTAAGAACTAACCTACATGAGAAGGATGTTCAACCCATAAAGGAACAATTGGCACAGAGTTTATTTGATCACATACCTGTAGGAGTGGGGTCTAAAGGAATAATACCAATGAATGCAAGGGATATGGAAGAGGCCTTAGAAATGGGAATGGATTGGTCATTAAGAGAGGGCTATGTTTGGGCTGAAGACAAAGAACATTGTGAAGAGTATGGAAGAATGCTAAATGCAGATCCATCTAAAGTAAGTCTAAGAGCAAAAAAGAGAGGTCTGCCACAACTAGGAACTCTAGGAGCTGGCAATCATTATGCAGAAATCCAAGTGGTTGATGAAATTTATGATAAATTTGGTGCGGGAAAGATGGGGCTAGAAAGAATTGGTCAAGTTTGTGTTATGATCCATTCAGGTAGCAGAGGCTTTGGACATCAAGTTGCTACTGATGCCTTAGTACAGATGGAGAAAGCCATGAAAAGAGATCAAATAGAAGTGAATGACAGACAGTTAGCCTGTGCTAGGATAAACTCAGTTGAAGGTCAGGACTACTTAAAAGCAATGGCAGCTGCTGCTAATTTTGCTTGGGTTAACAGAAGTTCAATGACATTCTTAACCAGACAGGCATTTGCAAAACAGTTCAAAATGTCTCCTGATGACTTAGACATGCATGTTATTTATGATGTTTCTCACAATATAGCTAAGATGGAGGAGCATATTGTTGATGGGAAAATAAAAACTCTTCTTGTGCATAGAAAGGGTTCCACCAGAGCTTTCCCTCCACATCATCCATTAATACCTGTAGACTACCAGTTGACCGGCCAGCCTGTTTTGATCGGAGGATCTATGGGTACTTGCAGCTATGTCCTCACTGGAACCCCACAAGGGATGACTGAAACTTTTGGATCCACTTGTCATGGAGCAGGACGAGCACTGTCTCGAGCCAAATCTCGGCGGAATATAGATTATAAGGAAGTTTTAGGAAAGTTAGAGAGTTTAGGAATATCTATAAGAGTTGCATCTCCAAAGCTTGTCATGGAGGAGGCACCAGAATCTTATAAAAATGTGACTGATGTAGTCGACACCTGCCATGCTGCTGGAATCAGCAAAAAGACTGTCAAATTACGACCCATTGCTGTCATAAAAGGATAG

Protein sequence:

>DPOGS206102-PA
MVVRQYNEELKYLEKVNPHCWKIKKGFQPNMNVEGVFYVNNTLERLMLEELKNSCRPGMTGGFLPGVKQIANVAALPGIVGRSVGLPDIHSGYGFAIGNMAAFDMSNPKSIVSPGGVGFDINCGVRLLRTNLHEKDVQPIKEQLAQSLFDHIPVGVGSKGIIPMNARDMEEALEMGMDWSLREGYVWAEDKEHCEEYGRMLNADPSKVSLRAKKRGLPQLGTLGAGNHYAEIQVVDEIYDKFGAGKMGLERIGQVCVMIHSGSRGFGHQVATDALVQMEKAMKRDQIEVNDRQLACARINSVEGQDYLKAMAAAANFAWVNRSSMTFLTRQAFAKQFKMSPDDLDMHVIYDVSHNIAKMEEHIVDGKIKTLLVHRKGSTRAFPPHHPLIPVDYQLTGQPVLIGGSMGTCSYVLTGTPQGMTETFGSTCHGAGRALSRAKSRRNIDYKEVLGKLESLGISIRVASPKLVMEEAPESYKNVTDVVDTCHAAGISKKTVKLRPIAVIKG-