Monarch geneset OGS2.0

DPOGS208117
TranscriptDPOGS208117-TA2706 bp
ProteinDPOGS208117-PA901 aa
Genomic positionDPSCF300154 - 97019-101049
RNAseq coverage2777x (Rank: top 4%)
Annotation
HeliconiusHMEL0122680.092.93% 
BombyxBGIBMGA006770-TA0.087.49% 
DrosophilaEct4-PI0.066.79% 
EBI UniRef50UniRef50_D6WZH30.068.99%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZH3_TRICA
NCBI RefSeqXP_394430.30.075.65%PREDICTED: similar to Ect4 CG7915-PB, isoform B, partial [Apis mellifera]
NCBI nr blastpgi|3800302350.075.83%PREDICTED: LOW QUALITY PROTEIN: sterile alpha and TIR motif-containing protein 1-like [Apis florea]
NCBI nr blastxgi|3287797410.074.85%PREDICTED: LOW QUALITY PROTEIN: sterile alpha and TIR motif-containing protein 1 [Apis mellifera]
Group
Gene OntologyGO:00054881e-43binding
GO:00055151.2e-17protein binding
GO:00312241.2e-13intrinsic to membrane
GO:00071651.2e-13signal transduction
GO:00048881.2e-13transmembrane receptor activity
GO:00450871.2e-13innate immune response
KEGG pathway 
InterPro domain[275-557] IPR0119891e-43Armadillo-like helical
[271-581] IPR0160246.5e-27Armadillo-type fold
[558-639] IPR0109931.2e-17Sterile alpha motif homology
[566-633] IPR0016609.4e-15Sterile alpha motif domain
[566-631] IPR0115106.6e-14Sterile alpha motif, type 2
[706-864] IPR0001571.2e-13Toll-Interleukin receptor
[558-635] IPR0137613e-13Sterile alpha motif-type
Orthology groupMCL13667 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208117-TA
ATGGCTTCTCGTCTAGTCGCATGGACCAGTAGGATATTCCGTCGCGGCGGTTCGCCGGGGGCGGTGAGCTCATTATCTCAGAGCGGTTATCGTTTACCGCACCATGCTCGTGTCAGTCATCGTTTGACCGCCGTGCGTGCCAGCACGCGGAAGTCAATTATTGCGCATGCGCCTTTGGGTACACGAATAGACGGCTTTAAATATAGAAACGGAATGTCTAACGGCGCGGGTTCGGCCCCGTGGCCGGTACACAGAAACGTCCTGTCGCGTTTCCCCCCTAAGCCGACGTATCCTTCGGAAAAGCGTTCTCTACAAGCGGGGGAAGTATCAGCTCAGGAGGCGAATAACATGTCCGCGACCAGTTCGAGATTACAAACGGAGGCTTTCAGTGCTGAGAAAAAGGCGATGGCATCATCACAGGCGAGACAGACGTTCACTTCCAGTGGAATTTTCAGTCACAAAGAACATTCAAGCGTCGCTCACTCCAACATGACCATATCCAGCAAGAATCTTAGCACTAAATCAACATTACTATCGTCTCAAATGAGTCAGCTGTTGAATGGGACAGTGAAACCGGGAGACGAAGACCTCTCCAACCTAACATTCGAAGATTTAGACAAATTGGATGCTAAGTCGAATCAGAAGGACGTAGATTTAGCGATTCAAAAATATTCACACAGGATGAACGCTTTCATAACGGCCATAAAAAATAATCAGATAGACATGAAAAACGCCTGCGTCCACTTCATGAAGTTAAACGAGATGGTCAAAAGAGCATGGGCTGTGCCTACATACGGCCATGAGTTAGGGTACTCGTTGTGCAACACGCTGAGATCGTCCGGCGGTTTGGACATTTTGATGGCGAACTGCTTGGAATCCAATAACCCGGATCTTCAATTCTGTTCCGCTAAACTATTGGAGCAATGTCTCACCACTGAAAATAGAGATCATGTAGTGCAAAATGGTCTCGAAAAAGTCGTTAACGTGGCCTGTGTGTGCACGAAGCATTCGAATTCAGTCGATCACTCAAGAATAGGTACTGGGATCTTGGAGCATTTGTTCAAACACAGCGAAGGTACTTGCAGTGATGTTATCAAGCTGGGAGGTTTAGACGCCGTTCTGTTTGAATGTAGAAAAAATGACGTGGAAACTCTGAGGCACTGCGCAACAGCTCTGGCGAACTTATCACTATACGGCGGCGCTGAAAACCAGGAAGCGATGATAAAAAGAAAAGTACCCATGTGGCTGTTCCCTCTAGCCTTCCACAACGACGACAACATCAAATACTACGCGTGTTTAGCCATCGCTGTTTTGGTAGCCAACAAAGAAATAGAAGCAGCCGTCTTGAAATCCGGAACCTTGGATCTGGTTGAACCCTTTGTTACTTCACACAACCCGTCGGAGTTCGCCCGATCAAACCTAGCGCACGCTCACGGTCAGAGTAAGAACTGGCTTCAAAGATTAGTCCCGGTTTTGAGTTCAAAGAGGGAAGAAGCGAGGAACCTGGCCGCCTTCCACTTCTGTATGGAGGCTGGTATTAAAAAGCAGCAAGGGAATACAGAGATATTTAGAGAAATAGGAGCTATAGAATCCTTAAAGAAAGTAGCCAGCTGTCCGAATGCTGTTGCGTCGAAATACGCAGCGCAGGCTTTAAGACTAATTGGAGAAGAGGTACCACATAAACTGTCCCAACAAGTACCTTTGTGGTCGATAGAGGACGTCAGGGAGTGGGTCAAACAAATAGGCTTCTCTGAATACGCGAACAATTTCTATGAAAGTAGAGTAGATGGTGACCTTTTGTTACAAATAACTGAAGCTAATCTCAAAGAAGACATAGGTTTAAATAACGGAATCAAACGTAAAAGATTCACGCGAGAACTTCAGCAATTAAAAAAAATGGCGGACTACAGTTCACGTGACACGGGGAGCCTTAACGAATTTCTACAGAGCATTGGTCCAGAATACACGATATACACGTATTCAATGTTGAATGCTGGTGTCGACAAGGAATCCATCCGTGGCCTGAGTGACGAACAGCTGGAAAATGAATGCAGAATAGGCAACAGTATACACCGGCTACGAATACTGAACGCTATACGAGCCTATGAAAGCACATTGCCTAGCAAAGGCGAAGAGAATATGGAGAAGAATTTGGACGTTTTCGTTAGTTACCGGAGATCAAACGGCTCACAGTTGGCCAGTTTGTTGAAAGTTCACCTACAACTGCGAGGTTTCACCGTTTTCATAGACGTGGAGCGATTAGAAGCTGGGAAATTCGATAATAATCTCCTCCAGAGTATACGCCAGGCGAAGCATTTCCTTCTAGTGTTAACCCCAAACGCACTGGAGAGGTGCAAACATGATAATGAACAAAAAGACTGGGTCCATCGGGAGATAGTGGCAGCATTGCAGTCACAGTGCAATATAGTTCCAATTATCGACAACTTCCAATGGCCGGAACCGGAAGAGTTACCGGAAGACATGCGAGCCGTTTGTCACTTCAATGGCGTCAGGTGGATACATGATTACCAGGACGCCTGTGTCGAGAAACTTGAAAGTTTCCTACGCGGCAAGTCGAACTTAGCAACTCGTCTGGAGGGTCCGCTCCGCGGTCGGGACGTGCCCACTCCCGGGACAGCCGCCATGCGACCACCAAACTATCAACGTATGGTCTCCACTGAGAGCAGGGGCAGTGATAAAGATTGA

Protein sequence:

>DPOGS208117-PA
MASRLVAWTSRIFRRGGSPGAVSSLSQSGYRLPHHARVSHRLTAVRASTRKSIIAHAPLGTRIDGFKYRNGMSNGAGSAPWPVHRNVLSRFPPKPTYPSEKRSLQAGEVSAQEANNMSATSSRLQTEAFSAEKKAMASSQARQTFTSSGIFSHKEHSSVAHSNMTISSKNLSTKSTLLSSQMSQLLNGTVKPGDEDLSNLTFEDLDKLDAKSNQKDVDLAIQKYSHRMNAFITAIKNNQIDMKNACVHFMKLNEMVKRAWAVPTYGHELGYSLCNTLRSSGGLDILMANCLESNNPDLQFCSAKLLEQCLTTENRDHVVQNGLEKVVNVACVCTKHSNSVDHSRIGTGILEHLFKHSEGTCSDVIKLGGLDAVLFECRKNDVETLRHCATALANLSLYGGAENQEAMIKRKVPMWLFPLAFHNDDNIKYYACLAIAVLVANKEIEAAVLKSGTLDLVEPFVTSHNPSEFARSNLAHAHGQSKNWLQRLVPVLSSKREEARNLAAFHFCMEAGIKKQQGNTEIFREIGAIESLKKVASCPNAVASKYAAQALRLIGEEVPHKLSQQVPLWSIEDVREWVKQIGFSEYANNFYESRVDGDLLLQITEANLKEDIGLNNGIKRKRFTRELQQLKKMADYSSRDTGSLNEFLQSIGPEYTIYTYSMLNAGVDKESIRGLSDEQLENECRIGNSIHRLRILNAIRAYESTLPSKGEENMEKNLDVFVSYRRSNGSQLASLLKVHLQLRGFTVFIDVERLEAGKFDNNLLQSIRQAKHFLLVLTPNALERCKHDNEQKDWVHREIVAALQSQCNIVPIIDNFQWPEPEELPEDMRAVCHFNGVRWIHDYQDACVEKLESFLRGKSNLATRLEGPLRGRDVPTPGTAAMRPPNYQRMVSTESRGSDKD-