Monarch geneset OGS2.0

DPOGS202900
TranscriptDPOGS202900-TA1791 bp
ProteinDPOGS202900-PA596 aa
Genomic positionDPSCF300126 - 84257-86246
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0159131e-11043.63% 
BombyxBGIBMGA006467-TA4e-3428.51% 
DrosophilaCG7896-PA1e-3529.16% 
EBI UniRef50UniRef50_E9CJE94e-4229.60%Tyrosine-protein kinase Src42A n=23 Tax=Opisthokonta RepID=E9CJE9_CAPO3
NCBI RefSeqXP_394034.34e-3528.79%PREDICTED: similar to CG8561-PA [Apis mellifera]
NCBI nr blastpgi|3201662081e-4129.60%tyrosine-protein kinase Src42A [Capsaspora owczarzaki ATCC 30864]
NCBI nr blastxgi|3287043208e-4831.08%PREDICTED: leucine-rich repeat-containing protein 15-like [Acyrthosiphon pisum]
Group
KEGG pathwaymdo:1000307812e-31 
 K04309 (LGR4, GPR48)maps-> Neuroactive ligand-receptor interaction
Orthology groupMCL21017 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202900-TA
ATGATGTTTCTGCACTTTATGATAATACTGCTTGCTGCTTTGCCATGTATTTGTGGAGAATGCCATTTCATACCTTTGCCTAAATTAGAAAAGTGCCGTGCTGCACTGAACTGTTCCATTACTTTGACTGACCCCTCAAGTGATCTAGATTCATGGACGTGCATACGAAGGCAAGAGATCAACTACAATTATTACTACCGAAATAACTATGACAACAGTTATACTCTTGATACTAATTACGAGTTATACATTACGTTACAAGGAGCAGATATAAATACATCCGAGGATCCACTATATGCACTTAATAGTATAAGGACCAGTAATGTAAAGACTTTGAGTATAATGGAAGGAAACTTAGTTTCAGTGTCACCAGACATTTACAAACTGAGTATGTTAGAGGAACTCGTTATCGTTAACAATATGGTTGAAAAACTAAACCTCGCCAAATTGGTCTTCATGTCTTCTCTGACTACACTCAATATGTCTAGAAATTTAATTTCAACAATCGAAAGCGTTGACTTTCAAATAGAAGCGAGCGGCGGGATTCATACTGTTGACTTGAGTTATAATCAACTGGAGACAATACCAGATAATTGTTTTTTAAAATTCCCCCGACTACTTTATCTCGATCTGTCTCATAATTCGATAAAGAAAATCGAGTTGCTCACGTTTGAAGGTATGAGAAGCATCGAAACACTGTTGCTCTCTTACAATGAAATAAAAAATATGGGCTTAAATTTTGTAAGGTTCATAAATTTGAAAAAACTGGAACTGGATCACAACGAACTGACGTCGTTAAGTGAAGAAAGCGTGAAGAATTTAATAAACTTGGAACACTTGAATCTCAGCTCGAATCAGTTAAGAACAATACAAGAAAATAGTTTTAGAGGGCTACAAAAATTACAGGAAATAGATTTGTCAAATAATCAAATCAAGACTGTTCAAGCACATCTGTTTCAAAGTAATATCAATCTGCACACAGTTTACTTTTCGAACAATTACATTGAAAATATACAGGATGGTGCTTTTGATGGAACTAATATAACAGAATTATCCATAAAAGGTAATTGCATCGTTGGAACTATAAATTCTAATACGTTTCTTGGAGTTCATGTCGACAGTATAGATTTGTCTGGCGGAAAGATAACGACTATTGGTGATGAAGCATTCAGTAATGTGGGAGGCGAATTGACTAGTTTGAATTTAAGCAGAAATAGCATAGAAATTATGGCAAAGTCGTGCTTCAAAAATTTGACCGCATTGTCCAAGTTGGATCTATCCAATAATGATTTAGTTGAAATAGATTTTGATAGTAGAGATTTAAAGATGTTGGAAGAGTTGTATTTGAAACAGAATAAAATTAAAAAAGTTCATAACATCGTATTTAGGGACCTAGAATCATTATTAACTTTAGACCTCTCTGAAAATGCTGTTCAGGAATTACAAGAAAACTATTTTGAAGGACTCAAAAACTTGGAAACACTTCTTCTTAATAATAACGAATTACATTTCTTAGCACCAAATGTGTTCAAAGGTTTGGAGAACCTCAATAAGTTAGACATGTCTCAAACAAGAATTATGTCCATCAATAATTTACTATTTGAAGGTTTAGTTTCCTTGGAAATATTAAACATTTCTCGAAGCCAGCTAAAAACAATCGAATTCGGAGCTTTTGATGGAACGAGTTGGATTAAAATTCTCGATGTGTCCTATAATGAATTAGAAAATTTTACTATATGGAAATACCGGGAGAGGTTTTTAATAACTTGCATATCTCAGAAGTATCCTTAG

Protein sequence:

>DPOGS202900-PA
MMFLHFMIILLAALPCICGECHFIPLPKLEKCRAALNCSITLTDPSSDLDSWTCIRRQEINYNYYYRNNYDNSYTLDTNYELYITLQGADINTSEDPLYALNSIRTSNVKTLSIMEGNLVSVSPDIYKLSMLEELVIVNNMVEKLNLAKLVFMSSLTTLNMSRNLISTIESVDFQIEASGGIHTVDLSYNQLETIPDNCFLKFPRLLYLDLSHNSIKKIELLTFEGMRSIETLLLSYNEIKNMGLNFVRFINLKKLELDHNELTSLSEESVKNLINLEHLNLSSNQLRTIQENSFRGLQKLQEIDLSNNQIKTVQAHLFQSNINLHTVYFSNNYIENIQDGAFDGTNITELSIKGNCIVGTINSNTFLGVHVDSIDLSGGKITTIGDEAFSNVGGELTSLNLSRNSIEIMAKSCFKNLTALSKLDLSNNDLVEIDFDSRDLKMLEELYLKQNKIKKVHNIVFRDLESLLTLDLSENAVQELQENYFEGLKNLETLLLNNNELHFLAPNVFKGLENLNKLDMSQTRIMSINNLLFEGLVSLEILNISRSQLKTIEFGAFDGTSWIKILDVSYNELENFTIWKYRERFLITCISQKYP-