Monarch geneset OGS2.0

DPOGS203817
TranscriptDPOGS203817-TA1152 bp
ProteinDPOGS203817-PA383 aa
Genomic positionDPSCF300010 + 2219416-2220853
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0133284e-9849.86% 
Bombyx% 
DrosophilaCG10189-PA2e-3230.31% 
EBI UniRef50UniRef50_B3RVD12e-4030.28%Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RVD1_TRIAD
NCBI RefSeqXP_002112006.13e-4130.28%hypothetical protein TRIADDRAFT_55610 [Trichoplax adhaerens]
NCBI nr blastpgi|3323738486e-4932.17%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323738483e-4931.90%unknown [Dendroctonus ponderosae]
Group
KEGG pathway 
InterPro domain[193-288] IPR0194072.3e-22Thiouridylase, cytoplasmic, subunit 2
[38-237] IPR0147291.6e-09Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL14884 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203817-TA
ATGAAGGACAAAGTTTGTAGGAAATGTAATGGTCCTGGTACTGTATTCGTAAGAAAAGAATATTGTTACTGTAATGATTGTTTTATAACTAACACTAATCACAAATTCCGAGCATGCATTGGTAAAAATAAAAAACTAACCTCAAATGAAAAGGTCCTCGTTTGTTTATCTGGAGGAAAAAGCTCAACAGTTTTATTGGATTTGATTTACAATGGAATTAACCTAGACAGCCATAAAAAACTTAGGATAGTGCCATTTTTCATTCACATAACAGGCAAGTATTTGAGAAGAATAAAAAAAGAAAGTGAATCGAAAAAAACCACCGACATGATTATAGATCAATGCCAAAAATACAATTTTGATCTATATGTGGTTAATATTCAGGAATATAAATCTAATGAAGACATCAGTTACTGTACAAACTCATACTCCTCAGCAACTTTGTCAGCGAAAAATTTAAACTTCACAACCACAACTGGCCAAGATGTACTGACTAAAATTAAACACAATCTTTTTATTAGAATGTCTAAACAATTAGGTTGCCGATTTGTATTGACTGCTGAAACCACTACACTATTGGCAATGAAATTATTATCTAGTATTGTCATTGGAAGAGGATCACAAGTCGAAAACGATATTGGGTTTGCAGATAATCGAGATGGTAATGTTGAGATTTTAAGACCTATGAGAGACATCACCAATGAGGAAATAGACATTTATCTTAACATAAAGCAAATGTATGTTGATCCTAAGGATAGTGATCAGGAAATTAGTTTGCAAGCAGCTATTAGAGATTTTGTTTTGGATCTGCAAGAAAATTATCCATCTACGATTTCAACAGTCTGTAAAACTGCTGACAAGCTTGGCTCTGTAAATGGGAATAATAATATAAAAAAATGTCATGTCTGCCAAAGTACAATTAATTTTAAAGACTCTAAGTTGACAGCTGTTGAAGCCACTATATTTTCAAGAATAGTATCCAGTGAAAAGAGTCCTGAAAGTGCTTCAGACCTCCCATTCAATACTAAAGACAACACAATGTTTCCTTATATTTTTGATAGATTTTGTTATTGTTGTAGTAGAAATTATTTAGAAACAAAAGGGTCTGATCTTAACATGTTTTTGTCACAGAAAATTGAAAAGGAATCATAA

Protein sequence:

>DPOGS203817-PA
MKDKVCRKCNGPGTVFVRKEYCYCNDCFITNTNHKFRACIGKNKKLTSNEKVLVCLSGGKSSTVLLDLIYNGINLDSHKKLRIVPFFIHITGKYLRRIKKESESKKTTDMIIDQCQKYNFDLYVVNIQEYKSNEDISYCTNSYSSATLSAKNLNFTTTTGQDVLTKIKHNLFIRMSKQLGCRFVLTAETTTLLAMKLLSSIVIGRGSQVENDIGFADNRDGNVEILRPMRDITNEEIDIYLNIKQMYVDPKDSDQEISLQAAIRDFVLDLQENYPSTISTVCKTADKLGSVNGNNNIKKCHVCQSTINFKDSKLTAVEATIFSRIVSSEKSPESASDLPFNTKDNTMFPYIFDRFCYCCSRNYLETKGSDLNMFLSQKIEKES-