Monarch geneset OGS2.0

DPOGS215416
TranscriptDPOGS215416-TA1332 bp
ProteinDPOGS215416-PA443 aa
Genomic positionDPSCF300088 + 719666-725047
RNAseq coverage1442x (Rank: top 9%)
Annotation
HeliconiusHMEL0174408e-11763.59% 
BombyxBGIBMGA012377-TA3e-12154.75% 
DrosophilaGld2-PB2e-7540.32% 
EBI UniRef50UniRef50_Q7PWZ21e-9351.73%AGAP001130-PA n=1 Tax=Anopheles gambiae RepID=Q7PWZ2_ANOGA
NCBI RefSeqXP_393329.22e-10152.19%PREDICTED: similar to CG5732-PA [Apis mellifera]
NCBI nr blastpgi|3800157691e-9953.43%PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Apis florea]
NCBI nr blastxgi|3800157693e-9553.43%PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Apis florea]
Group
KEGG pathwaydme:Dmel_CG57321e-73 
 K00970 (E2.7.7.19, pcnB)maps-> RNA degradation
InterPro domain[341-405] IPR0020581.1e-10PAP/25A-associated
Orthology groupMCL13133 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215416-TA
ATGAATGAAGGTGCTCCACTGGTGTATGGTAGCTGTGGCTACGAGGGTGGAGTGTACATGGTAGCTGGGCAAATAGACCGTAGTCGCTACCCAGTGGAGGCGCTGGGTGTGGGGCGCTCAGGTCATGGGACTCCTCGCCGATGGCCACGCAAGGAACGCTCAGTCAAACCAGAGAATACTGCATTTTCATATGACTCGGATGATTCCAGCCAGTCCAGCAGCGCCTCTGGATCAAATAAATCAAGTGAAAAAGGTGAGGTGCCCGTCCGTGGTGAGGGTCGTGAGTGGCGCGGCCGGGGTCGCCGGGAGCCCCGCCAGCGTCGTATAGCGCCCGAGCGCTACCTGGCCGCCTCTTACCCCTTCCAGGTGAAGTTCACACCAGACAACCTCCTCAACGGTTCGGAGTGGGACACGCTGTCTCAGGAGATATGGGACAAGTTCGTAAAGTCCCAGCAGACGGAAGAGACGTTCAGGAAGAAGATGAACCTGTGGCGCTACCTGTACATCAGCATCAAATCGATGTTCCCGAGATACGGGTTGTACGTGGTGGGCTCCACCATGTCGGGCTTCGGGCTGGACTCGTCCGACATGGACCTCTGTCTGCACGTGCGGGCGCTGGCCGAGCTCGAGCCCCGCGCGCACGCCCTGCTGCATCTCAACTACATACTCAGCCACATCAGGAGCTTCGACCCCGGTGCGGAATTAATTCAGGCGAAGGTCCCCATACTGAAGTTCCGCGACGAAAGAAACGGACTCCAAGTGGACTTGAACTGTAACAACGTGGTCGGCATCAGGAACACCAACCTACTGTACTGTTACTCCAGGATGGACTGGCGCGTCCGGCCGCTGGTGGCCATCACGAAGCTGTGGGCGCGCGCTCACCGCATCAACGACGCCAGGAGACGCACGCTGTCCTCGTACGCGCTCACACTCATGGTCATTCACTTCCTGCAATGCGGGACCAGTCCGGCAGTGTTGTGTCGCGCGGGTGAGGCCCGGTCGCGGGCTCAGAACAGGTGCTCGCTGGGGGAGCTGTTCCTCAACCTGCTCAAGTACTATGCTGAGTTCCCGTACGAGCAGATGGCGGTGTCGGTGCGCGCTGCTCGCCGCGTGCCCGTGTGGGAGTGTCGCGCGCGGGCCGCCGCCGCCCCGCCGCATCACTCGCCCGCACATTGGAAACTGCTCTGTGTGGAGGAGCCGTTCGACCTGACCAACACGGCGCGGTCGGTGTACGACCCGGAGACGTTCGAACAGATCGTGAGCACCTTCAGGTCCAGCTACACGAGGCTGGCGCGGGGGCTGCGACTCAGAGACGCCTGGCCGCAGCGATGA

Protein sequence:

>DPOGS215416-PA
MNEGAPLVYGSCGYEGGVYMVAGQIDRSRYPVEALGVGRSGHGTPRRWPRKERSVKPENTAFSYDSDDSSQSSSASGSNKSSEKGEVPVRGEGREWRGRGRREPRQRRIAPERYLAASYPFQVKFTPDNLLNGSEWDTLSQEIWDKFVKSQQTEETFRKKMNLWRYLYISIKSMFPRYGLYVVGSTMSGFGLDSSDMDLCLHVRALAELEPRAHALLHLNYILSHIRSFDPGAELIQAKVPILKFRDERNGLQVDLNCNNVVGIRNTNLLYCYSRMDWRVRPLVAITKLWARAHRINDARRRTLSSYALTLMVIHFLQCGTSPAVLCRAGEARSRAQNRCSLGELFLNLLKYYAEFPYEQMAVSVRAARRVPVWECRARAAAAPPHHSPAHWKLLCVEEPFDLTNTARSVYDPETFEQIVSTFRSSYTRLARGLRLRDAWPQR-