Monarch geneset OGS2.0

DPOGS208954
TranscriptDPOGS208954-TA3378 bp
ProteinDPOGS208954-PA1125 aa
Genomic positionDPSCF300009 + 566220-569597
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0225580.095.38% 
BombyxBGIBMGA002428-TA0.090.90% 
DrosophilaHem-PA0.075.76% 
EBI UniRef50UniRef50_UPI00020620950.066.26%UPI0002062095 related cluster n=1 Tax=unknown RepID=UPI0002062095
NCBI RefSeqXP_001602104.10.078.93%PREDICTED: similar to membrane-associated protein gex-3 [Nasonia vitripennis]
NCBI nr blastpgi|3320173670.079.62%Membrane-associated protein Hem [Acromyrmex echinatior]
NCBI nr blastxgi|3320173670.079.70%Membrane-associated protein Hem [Acromyrmex echinatior]
Group
KEGG pathwaynvi:1001180190.0 
 K05750 (NCKAP1, NAP125)maps-> Regulation of actin cytoskeleton
InterPro domain[11-1119] IPR0191370Nck-associated protein 1
Orthology groupMCL10632 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208954-TA
ATGTCTACTTTGTCGAGGTCAGTTCCTATCAGCCAGCAAAAGCTGTCTGAAAAACTGATAATTCTCAATGATCGGGGTATTGGAATGTTGACGCGCATTTACAACATTAAAAAGGCCTGTGGTGACGCTAAATCGAAACCAGCATTCCTGTCAGACAAAACATTGGAATCTTCTATCAAACACATTGTGAGAAGATTTCCCAATATTGATGTCAAAGGATTACAAGCCATAACTAATATAAGAAATGAGATAATTAAGTCCCTCTCCCTATACTATTATACATTTGTAGATCTTTTAGATTTCAAGGACAATGTTTGTGAGTTGCTTAATATTATTGATGGATGTCAGTTAACTCTAGATTTGACACTTAATTTTGAACTGACAAAAAATTACTTGGATCTTGTGACAACTTATGTTGCACTCATGATATTGCTGTCTCGAGTTGAAGACCGAAAAGCTGTGCTTGGATTGTTTAATGCTGCTCATGAAATGGTCCACAATCAGATAGATGCTAGTTTTCCACGTTTGGGACAAATGATAGTCGATTATGATGCTCCATTCAAGAAGCTGTCAGAAGAGTTTGCTCCACATCAAAAAGTTTTATCTAGTGCATTAAATTCATTATGGCATGTATATCCAGCAAGGAATAAGACAGCTGATCACTGGAGGTCTGAACAGAAGCTTAGCTTAGTGAGTAACCCATCACAACTTCTAAAACCATCTGAAACTGCAACTATGTCCTGTGAATACGTCTCATTAGAAGCACTAGAGAGATGGGTGATATTTGGTTTTGCATTGTGTCACCAAATGTTGCAACAAGACCATGCAAACAAAATGTGGGTTAGTGCCTTAGAATCCGGTTGGGTTGTTGCACTATTTCGGGATGAAGTTATTTACATTCATACTTATATTCAAAGCTTTTTTGATGGAATTAAAGGATATGGTAAAAGAATATCTGAAGTAAAGGATTGTTATCATCAATCTGTTCAAAAAGCAGGCTATAAACACAGAGAAAGAAGGAAGTTCTTACGAACTGCTTTAAAAGAATTGGGTCTCATTTTAACTGATCAACCTGGTTTGTTGGGACCCAAAGCACTGTTAATATTTATTGGTCTTTGCTATTCAAGAGATGAAGTTTTTTGGCTACTTCGACATAATGACAACCCACCACAAAAAGTTAAAGGTAAATCTACAGAAGATTTGGTAGATCGTCAATTACCTGAACTTCTGTTTCACATGGAGGAACTCAGAGCACTTGTAAGGAAATATAGCCAAGTTATGCAAAGATATTACGTACAATATCTATCAGGTTTTGATGCAGTAGCCTTGAATCTCATGATTCAGAATTTGCAGGTGTGTCCTGAGGATGAAAGTGTTATATTGTCATCTCTCTGTAGTACAGCTGCCAATTTAAGTGTTAAGCAAGTTGAGAGTAATGATCTATTTGATTTCCGTGCTTTTCGTCTTGATTGGTTTAGATTGCAGGCTTACACATCTGTTGCAAAATGTCAATTTAACCTTGTCGATCAGAGAGAACTTGCACAATTTATTGATAAAATGGTTTTTCATACGAAAATGGTTGATAATCTCGACGAGATAATGGTGGAAACTTCTGATCTGTCATTATTTTGTTTCTACAACAAAATATTTGAAAGCCAGTTTCATATGTGCCTGGAGTTTCCGGCCCAAAACCGTTACATTATTGCATTCCCACTTATTTGCAGTCATTTCCAGAACTGCACTCACGAATTGTGTCCGGAAGAAAGACATCACATAAGAGAAAGGAGTCTGTCGGTTGTTAATATATTCCTGGACGAAATGGCAAAAGAAGCAAAAAACATAATTACTACCATTTGTGATCACCAATGCACAATGAGCGACAAATTGTTGCCGAAACACTGTGCTCAAACTATAGCACATTTAGCGAACAGAAAGAAAAAGGACAAAGGTAAAAAGAATCCGATAGAGATTATAAAACCTGGAGCTGAGAGCTACAGAAAAACTCGAGAAGAATTAACAACAATGGATAAACTTCATATGGCGTTAACTGAATTGTGCTTTGCGATAAACTATTGCTCAACGGTTAATGTTTGGGAATACACATTCGCTCCTCGTGAATACTTACATCAGCATTTAGAAAATAGGTTTTCTAAAGCACTAGTAGGGATGGTCATGTTTAATCAGGACACAAGCGAAATCGCCAAACCTTCGGAACTATTGGTCAGTGTAAGAGCATACATGAATGTACTTCAAACCGTCGAAAATTATGTGCATATCGACATCACAAGAGTGTTCAACAATTGTCTCCTTCAACAGACACAAAGTTGTGATAGTCACGGTGAAAAAACTATTGCAGCCCTTTATACTCAGTGGTATTCCGAGATACTACTCAGGAGGGTTAGCGCGGGCAACATTTGCTTCTCAATGAGTCAGAAAGCTTTCGTGAGTTTATCTGCTGAAGGTGCAATCCCTTTCAATGCAGAGGAATATTCAGATATCAATGAACTAAGAGCACTAGCGGAGCTAATAGGACCTTATGGAATGAAACATCTCAGCGAGACTCTAATGTGGCACATCGCTAGTCAGGTTCAGGAACTTAAAAAGCTAGTAGTACAGAATAAAGAGGTCCTCCAAATGCTCCGGACAAATTTCGACAAACCTGATATAATGAGAGAACAATTCAAAAGGTTGCAACACGTTGACAATGTGTTACAAAGAATGAGCATTATCGGTGTAATTTTGAGTTTTCGCCAAATCGCACAAGAGTCCCTGCTCGATGTGTTGGAGCGTCGCATTCCGTTTTTGATAAGCTCCATCAAAGACTTCCAACAACAGTTACCCTCCGGCGAACCTTCGCGTGTCATTTCTGAGATGTGTTCAGCCGCTGGACTGTCGTGTAAAGTGGATCCTACATTAGCTTCCTCCTTGCGACAACATAAAGCGGAACTTGAAGAAGAGGAGCATCTCATAGTTTGTCTGCTGATGGTGTTTGTAGCCGTCTCTCTGCCGAGGCTCGCTCGTAACGAAGGTTCGTTTTACAGACCATCTCTAGAAGGGCATGCTAACAATATTCATTGTATGGCACCAGCAATCAATCACATATTTGGGGCTCTGTTCACTATCTGTGGTGAAGGTGATATTGAAGATAGAATGAAAGAGTTCTTAGCACTAGCTTCGTCGTCGTTGTTACGACTTGGACAAGAGACGGATAAAGAAGCTATAAGAAATCGTGAATCGGTTTATTTGCTACTCGATCTAATTGTTCAGGAATCACCATTCCTTACAATGGATTTACTGGAATCCTGCTTCCCGTACGTGCTCATAAGGAACGCTTATCACGAAGTATACAAACAGGAACAGATGCTTCTGCATTCATAA

Protein sequence:

>DPOGS208954-PA
MSTLSRSVPISQQKLSEKLIILNDRGIGMLTRIYNIKKACGDAKSKPAFLSDKTLESSIKHIVRRFPNIDVKGLQAITNIRNEIIKSLSLYYYTFVDLLDFKDNVCELLNIIDGCQLTLDLTLNFELTKNYLDLVTTYVALMILLSRVEDRKAVLGLFNAAHEMVHNQIDASFPRLGQMIVDYDAPFKKLSEEFAPHQKVLSSALNSLWHVYPARNKTADHWRSEQKLSLVSNPSQLLKPSETATMSCEYVSLEALERWVIFGFALCHQMLQQDHANKMWVSALESGWVVALFRDEVIYIHTYIQSFFDGIKGYGKRISEVKDCYHQSVQKAGYKHRERRKFLRTALKELGLILTDQPGLLGPKALLIFIGLCYSRDEVFWLLRHNDNPPQKVKGKSTEDLVDRQLPELLFHMEELRALVRKYSQVMQRYYVQYLSGFDAVALNLMIQNLQVCPEDESVILSSLCSTAANLSVKQVESNDLFDFRAFRLDWFRLQAYTSVAKCQFNLVDQRELAQFIDKMVFHTKMVDNLDEIMVETSDLSLFCFYNKIFESQFHMCLEFPAQNRYIIAFPLICSHFQNCTHELCPEERHHIRERSLSVVNIFLDEMAKEAKNIITTICDHQCTMSDKLLPKHCAQTIAHLANRKKKDKGKKNPIEIIKPGAESYRKTREELTTMDKLHMALTELCFAINYCSTVNVWEYTFAPREYLHQHLENRFSKALVGMVMFNQDTSEIAKPSELLVSVRAYMNVLQTVENYVHIDITRVFNNCLLQQTQSCDSHGEKTIAALYTQWYSEILLRRVSAGNICFSMSQKAFVSLSAEGAIPFNAEEYSDINELRALAELIGPYGMKHLSETLMWHIASQVQELKKLVVQNKEVLQMLRTNFDKPDIMREQFKRLQHVDNVLQRMSIIGVILSFRQIAQESLLDVLERRIPFLISSIKDFQQQLPSGEPSRVISEMCSAAGLSCKVDPTLASSLRQHKAELEEEEHLIVCLLMVFVAVSLPRLARNEGSFYRPSLEGHANNIHCMAPAINHIFGALFTICGEGDIEDRMKEFLALASSSLLRLGQETDKEAIRNRESVYLLLDLIVQESPFLTMDLLESCFPYVLIRNAYHEVYKQEQMLLHS-