Monarch geneset OGS2.0

DPOGS216101
TranscriptDPOGS216101-TA873 bp
ProteinDPOGS216101-PA290 aa
Genomic positionDPSCF300182 - 251736-259501
RNAseq coverage985x (Rank: top 13%)
Annotation
HeliconiusHMEL0080111e-8794.89% 
BombyxBGIBMGA009216-TA5e-7492.37% 
Drosophila% 
EBI UniRef50UniRef50_E0V9C32e-7952.58%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0V9C3_PEDHC
NCBI RefSeqXP_975168.12e-8456.25%PREDICTED: similar to Toll-interacting protein [Tribolium castaneum]
NCBI nr blastpgi|910815653e-8356.25%PREDICTED: similar to Toll-interacting protein [Tribolium castaneum]
NCBI nr blastxgi|910815651e-7955.75%PREDICTED: similar to Toll-interacting protein [Tribolium castaneum]
Group
Gene OntologyGO:00055153.2e-21protein binding
KEGG pathwayhsa:544728e-65 
 K05402 (TOLLIP)maps-> Toll-like receptor signaling pathway
InterPro domain[55-183] IPR0089733.2e-21C2 calcium/lipid-binding domain, CaLB
[237-290] IPR0090601e-14UBA-like
[250-289] IPR0038921.4e-13Ubiquitin system component Cue
[66-143] IPR0000088.6e-08C2 calcium-dependent membrane targeting
Orthology groupMCL16518 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216101-TA
ATGACCTCCACAGTGCCTAACGAAAGAAATGAAGAACGCCGACGCAGAGTACTACTAGGGCCCTTGCCAGCAGGTTTCTTAAGAGCTGATGGAACTACGGATACGATGGATACAGACTATCAAGCAGCCCTGGCCTTACAGCAGCAATTATGTGGTGCTACAATGCCTCCAGCGGGACCTCCTCTCACAGCCAGGCTAAGCGTCACAATAGCACAGGCGAAACTCGTGAAGAATTATGGTCTAACCCGTATGGATCCCTACGTCAGAGTTCGCGTGGGTCATTGTATATACGAGACTCAGACGGATCCCAGCGGTGGAAAAACGCCGCGCTGGAACAAAGTTATACATTGTCTTCTACCCCCCGGCGTGAACTCGCTGTACCTGGAGATCTTTGACGAATGCTCCTTCACCATGGACGAGCTGATTGCTTGGACACATATCTCCATACCTCAGGCCGTGCTTAATGGCGAGACTCACGAGGACTGGTATCCTCTGAACGGTAAACAGGGTGACGGTCTGGAGGGGATGATAAACCTCGTTCTCAGCTATTCGGTGGGTCCTGCGGCGATCACCACCTACCCACCGGTGCTGGTGGTGCCGAGCACTGGTCTAGGCTACGCAGCCATGCCAATGTACCCGGCCCCTGTACACGCTATGCCGCAGCAACAGCAACAGATGACGCAGCAGCAACAAATGGCGCAGCAACAAATGGCGCAGCAACAGCAACAGCAACAGCCGATTACCGCGGAGCAACTACAACAGATCGAGGAAATGTTCCCGAGTATCGACAAGGAAGTTGTTAAATCAGTATTGGACGCGAACCGCGGGAACAAAGACGCTGCTATCAACTCCTTGCTGCAGATGTCTGAGTAA

Protein sequence:

>DPOGS216101-PA
MTSTVPNERNEERRRRVLLGPLPAGFLRADGTTDTMDTDYQAALALQQQLCGATMPPAGPPLTARLSVTIAQAKLVKNYGLTRMDPYVRVRVGHCIYETQTDPSGGKTPRWNKVIHCLLPPGVNSLYLEIFDECSFTMDELIAWTHISIPQAVLNGETHEDWYPLNGKQGDGLEGMINLVLSYSVGPAAITTYPPVLVVPSTGLGYAAMPMYPAPVHAMPQQQQQMTQQQQMAQQQMAQQQQQQQPITAEQLQQIEEMFPSIDKEVVKSVLDANRGNKDAAINSLLQMSE-