Monarch geneset OGS2.0

DPOGS213456
TranscriptDPOGS213456-TA1548 bp
ProteinDPOGS213456-PA515 aa
Genomic positionDPSCF300100 - 523345-529562
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0168248e-14085.30% 
BombyxBGIBMGA004482-TA2e-12373.31% 
DrosophilaCG8207-PA3e-9961.00% 
EBI UniRef50UniRef50_G1REB07e-9339.32%Uncharacterized protein n=6 Tax=Hominoidea RepID=G1REB0_NOMLE
NCBI RefSeqXP_552528.15e-10562.67%AGAP011723-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|579093711e-10362.67%AGAP011723-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|579093715e-10262.67%AGAP011723-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00090587.5e-25biosynthetic process
GO:00167797.5e-25nucleotidyltransferase activity
KEGG pathwayaga:AgaP_AGAP0117232e-104 
 K00966 (E2.7.7.13)maps-> Amino sugar and nucleotide sugar metabolism
    Fructose and mannose metabolism
InterPro domain[3-200] IPR0058357.5e-25Nucleotidyl transferase
[375-407] IPR0014511e-06Bacterial transferase hexapeptide repeat
Orthology groupMCL13476 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213456-TA
ATGTTGAAGGCTGTGATTCTTATAGGTGGACCACAAAAAGGTACAAGATTCCGTCCTCTATCTTTAGACACACCTAAACCTTTATTCCCTATAGCTGGCCTTCCTTTGATCCAGCACCATATTGCCGCTTGTGTAAAGCTTGGAGAATGCAAAGAAGTTCTTATAATAGGATCATATACCACAACTACTATGGCCCAATTTGTTAGTGATATGCAAAAAGAATACAAAATAATTATAAGATATCTTCAAGAATTCACTCCGCTGGGCACCGGCGGTGGTTTGTACCACTTCAGAGATCAAATCCGTGCGGGCAATCCGACGGCATTCTTTCTATTGAATGGTGATGTGTGTGCCGACTTCCCTCTCAAAGAACTGTGGACCTTCCATGAGAAAACATCACAATCGTTGATAACAATTATGGGCACTGAAGCTACGCGACAGCAGTCAGTTCACTACGGCTGTATAGTTCGAGAGCCAACCAGCAATTCGGTCACACATTATGTTGAAAAACCAAATAGTTATATATCTACATTGATCAATTGTGGGGTGTATGTGTGTTCGTTGCAAATTTTTCACACCATGGCCGATGCATTTCAAAGGAAACAGGAGGGATTTTATAGTGGCAATGGTCAAAATGGTTCACACCCCGGTTATATGTCCTGGGAACAAGATGTGCTAGCGCCTCTAGCAGGGACAAATAAGGTGTTCGCTCTGCAGATAACAATTATGGGCACTGAAGCTACACGACAGCAGTCAGTTCACTACGGCTGTATAGTTCGAGAGCCAACCAGCAATTCGGTCACACATTATGTTGAAAAACCAAATAGTTATATATCTACATTGATCAATTGTGGGGTGTATGTGTGTTCGTTGCAAATTTTCCACACCATGGCCGATGCATTTCAAAGGAAACAGGAGGGATTTTATAGTGGCAACGGTCAAAATGGTTCACACCCCGGTTATATGTCCTGGGAACAAGATGTGCTAGCGCCTCTAGCAGGGACAAACAAGGTGTACGCTCTGCAGGTGACCAATTGGTGGTCGCAGGTCAAGACAGCTGGTTCAGCGATCTACGCGAACAGACATTACCTGGAACTTCATCCGTCAACACCTGCAACCACTTGTCACATAATACCAGATGTATACATACATCCGACCGCTACAGTACATAGCAGCGCTGTTATAGGACCAAACGTGTCTATTGGCGCGGGGGTCACCATACAGGCGGGGGTACGAATAAAAGAGTCCATTGTTCTTAACAACGCGACTGTACATGAACACGCGCTTGTTATGTATACTGTTGTCGGTCAAGAGGCGTCAGTGGGTGAATGGTCCAGGGTTGAAGGCACTCCTTCAGACCCGGACCCCAACAAACCGTTTGCCAAAATGGACAATACACCGTTGTTCAACAGCGACGGAAGACTTAACCCCTCGATAACTATACTTGGGGCGGGTGTGGTGGTGCCGGATGAGATGATTCTTCTTAATTCCATAGTACTACCGCACAAACACCTGACGAGGAGCTTTAAACATGAGATCATATTATAG

Protein sequence:

>DPOGS213456-PA
MLKAVILIGGPQKGTRFRPLSLDTPKPLFPIAGLPLIQHHIAACVKLGECKEVLIIGSYTTTTMAQFVSDMQKEYKIIIRYLQEFTPLGTGGGLYHFRDQIRAGNPTAFFLLNGDVCADFPLKELWTFHEKTSQSLITIMGTEATRQQSVHYGCIVREPTSNSVTHYVEKPNSYISTLINCGVYVCSLQIFHTMADAFQRKQEGFYSGNGQNGSHPGYMSWEQDVLAPLAGTNKVFALQITIMGTEATRQQSVHYGCIVREPTSNSVTHYVEKPNSYISTLINCGVYVCSLQIFHTMADAFQRKQEGFYSGNGQNGSHPGYMSWEQDVLAPLAGTNKVYALQVTNWWSQVKTAGSAIYANRHYLELHPSTPATTCHIIPDVYIHPTATVHSSAVIGPNVSIGAGVTIQAGVRIKESIVLNNATVHEHALVMYTVVGQEASVGEWSRVEGTPSDPDPNKPFAKMDNTPLFNSDGRLNPSITILGAGVVVPDEMILLNSIVLPHKHLTRSFKHEIIL-