Monarch geneset OGS2.0

DPOGS203417
TranscriptDPOGS203417-TA1257 bp
ProteinDPOGS203417-PA418 aa
Genomic positionDPSCF300003 + 1783285-1784690
RNAseq coverage104x (Rank: top 60%)
Annotation
HeliconiusHMEL0036300.079.00% 
BombyxBGIBMGA010957-TA0.086.63% 
DrosophilaZyx102EF-PC9e-8159.53% 
EBI UniRef50UniRef50_B0W5S33e-8751.38%Lipoma preferred partner/lpp n=3 Tax=Culicidae RepID=B0W5S3_CULQU
NCBI RefSeqXP_967989.17e-11046.29%PREDICTED: similar to lipoma preferred partner/lpp [Tribolium castaneum]
NCBI nr blastpgi|910852891e-10846.29%PREDICTED: similar to lipoma preferred partner/lpp [Tribolium castaneum]
NCBI nr blastxgi|910852892e-11346.29%PREDICTED: similar to lipoma preferred partner/lpp [Tribolium castaneum]
Group
Gene OntologyGO:00082704e-14zinc ion binding
KEGG pathwaymdo:1000180843e-68 
 K12792 (TRIP6)maps-> NOD-like receptor signaling pathway
InterPro domain[343-405] IPR0017814e-14Zinc finger, LIM-type
Orthology groupMCL25797 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203417-TA
ATGAATGCTTTAGATAAACATCTAGCAGATTTACGAATATATGCTAATGAATTGGACGCAAAATTACAAGATGACATTAATGCATCCAAAGTAAAACCATCGATCCCTCCAAAAAAAAATAAAGCTCCTTTGCCCCAAATACCCCAAAGTTACACAGTGAAATCCGCCTGCGAGCCGTCATATTCTTCAAGATTAGAAAACGCTAGTCGGACCAGTTCTACATACTCAAATTTGGTACCTGTAAAGAGTTATATTGTAAGAAATTCGGCTCGTGTCCGAAGCGGTGATGTGCTCTTATACAGCAATTTACTGCCACCAGGAGGGACAAATCACGTTTACTCCAATATTCAATCCTCGCGGAACGTTGAAGGCTTTTACGATCGAGATACTAGGTCCGAATTGTCTAGAAATAGCGACTTAGGAGCGCAATCAACTTATAGCGAACTACGAAGACCTGGTTCAGTATACCCACCATCTTCGGCTTCTGTGAACGCTCAAGATTTTTCTACATATGGAGGCTCTCAGACATCTTCAACATATGAATCTCTCTATGAACCTATCAACCCACGCCCATGTAGCCAATTGTCGGGAACTTCTTTATATGGTGGATATGTTGGCACTGCTGTGGAGCCAATACCAGAACCAGACGATGCTCTTTATTGTGGCAATTGCTACAGATGTGGTGAGAAAATTATGGGAGAGACTACCGGTTGTACTGCTATGGAAAGAATATATCACATTAAATGCTTCTGCTGTCATCAGTGTGGTATCAATTTGCAGGGGCGACCATTCTATGCAGTGCAAGGGAAAGCCCTTTGCGAGGTGGATTATTTAGAAACACTTGAAAAATGTTGTGTCTGTAATGATCCAATCCTTGACCGAATACTGAGAGCCACCGGCAAACCATACCACCCGCGCTGCTTCACATGTGTAATGTGCCAGAAGAGTCTAGACGGCATCCCTTTTACCGTGGATGCAGTGAATCGTATACATTGTATTGAAGACTTTCATAAACGTTATGCGCCCCGTTGCGCTCAATGTCGGGAACCAATTATTCCTGAGGGTGGAGCCGAAAAAACTGTCCGAATTGTTGCATTGGATAAGAGTTTTCATATTGCCTGTTATGCTTGCGAAGATTGTGGAGCCTCGTTGTGCTCTAGAGACGAAGGCAGTAGATGTTATCCGCTAGACGATCACTTGTATTGTAAACAATGCAATGCTAGACGTATTCAGGATCTATCAAGAAATATCAATTAA

Protein sequence:

>DPOGS203417-PA
MNALDKHLADLRIYANELDAKLQDDINASKVKPSIPPKKNKAPLPQIPQSYTVKSACEPSYSSRLENASRTSSTYSNLVPVKSYIVRNSARVRSGDVLLYSNLLPPGGTNHVYSNIQSSRNVEGFYDRDTRSELSRNSDLGAQSTYSELRRPGSVYPPSSASVNAQDFSTYGGSQTSSTYESLYEPINPRPCSQLSGTSLYGGYVGTAVEPIPEPDDALYCGNCYRCGEKIMGETTGCTAMERIYHIKCFCCHQCGINLQGRPFYAVQGKALCEVDYLETLEKCCVCNDPILDRILRATGKPYHPRCFTCVMCQKSLDGIPFTVDAVNRIHCIEDFHKRYAPRCAQCREPIIPEGGAEKTVRIVALDKSFHIACYACEDCGASLCSRDEGSRCYPLDDHLYCKQCNARRIQDLSRNIN-