Monarch geneset OGS2.0

DPOGS211161
TranscriptDPOGS211161-TA2145 bp
ProteinDPOGS211161-PA714 aa
Genomic positionDPSCF300007 + 136588-139899
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0172050.072.52% 
BombyxBGIBMGA003146-TA0.077.85% 
Drosophilajub-PB1e-13764.83% 
EBI UniRef50UniRef50_B4JWX53e-13667.59%GH17840 n=4 Tax=Coelomata RepID=B4JWX5_DROGR
NCBI RefSeqXP_002071725.13e-14069.86%GK10130 [Drosophila willistoni]
NCBI nr blastpgi|1954485897e-13969.86%GK10130 [Drosophila willistoni]
NCBI nr blastxgi|1947694168e-15445.75%GF19089 [Drosophila ananassae]
Group
Gene OntologyGO:00082702.7e-17zinc ion binding
KEGG pathwaymdo:1000180842e-57 
 K12792 (TRIP6)maps-> NOD-like receptor signaling pathway
InterPro domain[559-624] IPR0017812.7e-17Zinc finger, LIM-type
Orthology groupMCL14395 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211161-TA
ATGGACTCTTCAAGAAGAAATAATTCTGAGGAGCAAAGGTGTATACTGGGTTTGCAGGATTTGAAATTATCTAACAAAGACGAAAATGATACTGAGCAAGCGCAAAGTACTTCAACATCTTACATTACTGGTCAATCAAAGTCTCGACCCGTGGGTGCGATAGAGAGCTTTACTTTAGATAATTCGGCAAATCGTCAGGATTATAGTTTCTACGAACGGTCCAATGTTATCGCGGCTTCAAAATTTGCAACCCCTAAGCGGGTAGAAACTATCGCGTCGATCAATAAACGGGATATAAACACTGCTGCGCTTCCAGCTAGTCCTACAAGATCTATAGATTCCGTATCTCAGAGGTCATTGAGCTACCAAAGTGAAGATTTATACGAGATCCCAGGCAATACGAGTGCTCAGTACCCTCAACCAGTGTATGAAAATATAGATTATTATACGGAACAACAGTTGTCCCAGCCGCCCTATTTTCACCAACTGTATCAGAGCCAGCCTCTTGCAGCCGAAGGGTACTACGACACACGGTACAGTAAAGCGCAGCCTCAAGTGCCAGTAAATAGTAAACCAAAGCCATTGGTATATGAAAATATGCTTTGTCCTGATAAAGATAAAAAACTCGATTATAAAATGCAAGGTAATGTAGAACAACCCTACATACAACAAGGCTCTGTTCCTGGCCCCCAGGTCCCAAATACTAGCATGAATAGGATGACATCCCAAAGGATTCTTAGGGATAAGCCTAAAGTTGAAAAGGCTCCACCTCCACCACCATATAACAGTTCAAATGATAACTCCAGCAGCAGTGACTACACAATTATGAAGTCTGCACCTCCTAAGTATATGGATGTCAGTGCTCTGATAAAAAATAGTAACTCTGTAAGTTTTGACAATAAAGCTGTGACAATTCCTTCCACTGCTATTTATGCTTCAATCGCTCCAAGTGCTCAAAGACCCAAGGCAAATATTGCTACAGAAGTGCAAGTGGCAGCGGCCGCAACTGCCCCAAAATATTTTCACTCAAAGCCACCTATTGGTAATGGTCTAGGAAAAGTTAAAGTGACAGCCATCAATGGAAGCTATGGTACAACTCCAAAACTACTGCCTAACAGGATGCAGATTGTGGAAGACACTAATGGTTCTGATTATGTTTGTATGACTGGAGGTTCGCCAGCTACAACTACTGCCTCTGCTAATAAGGACAATGTCTCCTTAGCATCAGAATCAATTTCATCTAGAAACTTAGTATCAGGGTTAACATCCTTGTGTTCACCAGCTCCATCCCCCGCACCAAGTCCTACCCCTAGTTCAGTATCTCAAGTTTCTGGTGGATCTCGAGGCTCTGGTGGCAGAGGGAAAAGCTTACTACCTTATAGTATTACCCCACCTCGCCCGCCAGGACCCAGTGAAGCTCAACGTAAAATTGAAGAGCTGACACGGCAACTTGAAGAAGAAATGGAAAGGCAAGATGAGGAAGGGGAATATTTCGGTATCTGCCACACTTGTGGGGCTGGTGTGACGGGCGCCGGCCAGGCTTGCCAGGCTATGGGCAATCTATACCACACCAACTGTTTCATATGCTGTTCTTGTGGAAGAGCCTTACGTGGCAAGGCCTTTTACAATGTACATGGCAAAGTGTACTGCGAAGAGGATTATCTGTACTCAGGATTCCAGCAAACGGCTGAGAAATGTGCAATATGTGGACATTTAATAATGGAAATGATCCTTCAAGCGATGGGCAAGTCCTACCACCCTGGATGTTTCCGTTGTTGTATCTGTAACGAGTGTTTGGATGGAGTACCTTTCACCGTTGACGTTGACAACAAGATTTACTGTGTGAACGATTACCATCGAATGTTTGCACCTAAATGTGCCAGCTGTGGGAAAGGGATAACTCCAGTGGAAGGCACTGACGAAACAGTACGAGTGGTGTCTATGGACCGTGACTTCCATGTTGATTGCTACATGTGTTGTGTTTGTGGCATGCAGCTCACAGACGAGCCGGACAAGAGATGCTATCCATTAGCTGGTCAATTGATGTGTCGAGCCTGTCACCTGTCCACCATAGGAGCGTCAACAGGCCCGGGCATGCCTCCAGCACCACATCTCTCACCAGCCTCATACCAATACATGGGATAA

Protein sequence:

>DPOGS211161-PA
MDSSRRNNSEEQRCILGLQDLKLSNKDENDTEQAQSTSTSYITGQSKSRPVGAIESFTLDNSANRQDYSFYERSNVIAASKFATPKRVETIASINKRDINTAALPASPTRSIDSVSQRSLSYQSEDLYEIPGNTSAQYPQPVYENIDYYTEQQLSQPPYFHQLYQSQPLAAEGYYDTRYSKAQPQVPVNSKPKPLVYENMLCPDKDKKLDYKMQGNVEQPYIQQGSVPGPQVPNTSMNRMTSQRILRDKPKVEKAPPPPPYNSSNDNSSSSDYTIMKSAPPKYMDVSALIKNSNSVSFDNKAVTIPSTAIYASIAPSAQRPKANIATEVQVAAAATAPKYFHSKPPIGNGLGKVKVTAINGSYGTTPKLLPNRMQIVEDTNGSDYVCMTGGSPATTTASANKDNVSLASESISSRNLVSGLTSLCSPAPSPAPSPTPSSVSQVSGGSRGSGGRGKSLLPYSITPPRPPGPSEAQRKIEELTRQLEEEMERQDEEGEYFGICHTCGAGVTGAGQACQAMGNLYHTNCFICCSCGRALRGKAFYNVHGKVYCEEDYLYSGFQQTAEKCAICGHLIMEMILQAMGKSYHPGCFRCCICNECLDGVPFTVDVDNKIYCVNDYHRMFAPKCASCGKGITPVEGTDETVRVVSMDRDFHVDCYMCCVCGMQLTDEPDKRCYPLAGQLMCRACHLSTIGASTGPGMPPAPHLSPASYQYMG-