Monarch geneset OGS2.0

DPOGS202447
TranscriptDPOGS202447-TA5388 bp
ProteinDPOGS202447-PA1795 aa
Genomic positionDPSCF300174 - 249562-266877
RNAseq coverage3426x (Rank: top 4%)
Annotation
HeliconiusHMEL0156470.046.16% 
BombyxBGIBMGA009970-TA0.050.40% 
DrosophilaZasp52-PF9e-9377.40% 
EBI UniRef50UniRef50_UPI0002063F433e-11853.11%UPI0002063F43 related cluster n=1 Tax=unknown RepID=UPI0002063F43
NCBI RefSeqXP_001975468.17e-10544.69%GG22334 [Drosophila erecta]
NCBI nr blastpgi|3504082952e-12049.26%PREDICTED: hypothetical protein LOC100744292 [Bombus impatiens]
NCBI nr blastxgi|3504082958e-12446.21%PREDICTED: hypothetical protein LOC100744292 [Bombus impatiens]
Group
Gene OntologyGO:00055154.1e-26protein binding
GO:00082702.9e-18zinc ion binding
KEGG pathwayisc:IscW_ISCW0098998e-35 
 K05760 (PXN)maps-> Chemokine signaling pathway
    Regulation of actin cytoskeleton
    Leukocyte transendothelial migration
    Bacterial invasion of epithelial cells
    Focal adhesion
    VEGF signaling pathway
InterPro domain[1-101] IPR0014784.1e-26PDZ/DHR/GLGF
[1740-1793] IPR0017812.9e-18Zinc finger, LIM-type
[153-178] IPR0066431.4e-09ZASP
Orthology groupMCL25674 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202447-TA
ATGGCACAGTTGATAACTGTGCGACTGAACAAGTCCGATCAGCAGCCTCTTGGCTTCAGGCTGCAGGGCGGCAAGGATTTCGGCACTCCGCTGGTTGTACAGAAGGTGAACGGTGGGAGTGCGGCTGAGCGGGCGGGTTTGCAGGCTGGGGATGCGCTCATTCGAGTCAACAATACTGACGTGTACTCCCTGAGACATCAAGAGGCACAGGACGCTATACGCGCTGCTGGCGGGAATCTGGAACTGACTGTGCAAAGAGGTGGTGGTACGTGGCGTCCTACCGTCACTCCTACTGGAAGCCTCCCTCGCCCGGGATCTCGTCCACTGGGTGCCGCCCCCGCTCCAGTCACCAGCACCTCTCTGAAGGCGACCCCTCAACCTTCGAGGGCCTTCGGTTCTGGTCACAACAACGTCGCCAAGCCGTTTGGATATATGAATGGCAACGATTCAGTGAAGAGCATTGTCAACAAACAGTACAACACACCTGTTAGTATGTACAGCGACAAAACTATCGCTGAGACACTCTCCGCCCAGACCGAGGTTCTTGCGGGCGGTGTTTTGGGAGTGAACTTCAAGAAGAACGAAAAAACTTACGACGCTGAAAAAAGTGCTGTATTCAAGGTGTTGCAAGAGGCTGAAAACGATCCTGAGCCAGTATCTGAGGCGAGCCCGGGGGCGACGACCCCTGTGAGTGGTCTCCGTCACGTGTCCGCGCCCGTGGCCCGCGACACGCCCGTCAACACCGGGGGCCTGCCCACGGGACAGAACATCTGTGAGGACTGCGAGCGACTTATCACGTCAGCAGAGGCGCCTCGGTTCCTGCCGTCTTCGAGGCTGGCTCACTTGGCGCCCGAGGCCCCCCACCGGCCCAGCATCCCACTGGGCTGTTCTCGCGTGTTGTCGGACGGCCGCGTGGCGCTGGGCCCCCCCCAGCCGCCCCACGGTCCCCTCAACGCCCCCACCGAGGCCCCGCACTGCTCCGAATGTAACGGCCACATCGTGGGTGTGTTCGTACGTATCAAGGACAAGAATTTGCACGTGGAGTGCTTTAAGTGCGCCACGTGCGGTTCCTCGCTGAAGAACCAGGGTTACTACAACCTGAACGGGAAGCTGTACTGCGACATCCACGCCAAGCTGGTTGCGAGACAGAACCCGCCCGCACCGAACTTGGAACCCGTCACTGTAGCTCCCGGTGGCCGCGTGCCGACGAACGCTTACTCGACTCCGCTGCCACCGCTGTCCACCAACAACTACACCAACGGATCATCATCGATGTTTAGCCCATCTAGTAATCTGTCTGGTCCGAAGCCGTTCGGTTCGTCCCTGGGCACGTATTCTCCGTCGTCGTTGTCTCCTCGCTCGGCGCCGCTCTCTCCCCGGACACCAAACTCTGCACCTGCACCTGCACCTGCACCGGCCCCTGCACCACAACACGCGTTCGCACCAACCAAAAACGTCAAAAGCATTGTGTGGCCCCCTCCTAATCCCTCAGAAGATGAACCCGAAAGTGAACTTAATGTAAATTGTAATCAAACCTCATTACACAGTGATTTCACTTCTTTGTCTGAAGAAACAATAACACAAAATACCAACAATAGAAAAATAACTGCTAATGAAATAAAGACTACGGATTCATCTATAACTCCGTTGTTGCAAACAATGGAGTTTTCTTCAAACGCCTTATTTGATACGACGCAGATGGCGTCGTCTACCATAAAATCACAAACCGCTGTTCAGAAGCAACGATCTGAGATTAAACAAGAATTTTCAAGTTGCACCGTTCAATCGTACTCACAGCAAGCTTCTGATTGCAAACAATTTTCTGTCTATCAAAATGTTGTGGCACCAGAAGAAATGAATAAGAGAAATAACACCATGGAAAAATCACGACTTGAACATAGTTCAAATGTCGTGCAATCTAAATTTAAGACGGGAGAGAGTGTGTCTCAAAACTTTCCTCCTCTCAATGCAAATACTTCAAACACAAAAATGCAGCCATGTTGGAATAAAGTAAGCGATAATAAAATACAAACTAAAGAAAATCCAATGTCATCAAAAGTGACACAATCAGGTGTTAAGGCACCCGAATCTAAGAAGTCTATAAGTGGAAAACCCCTGGCTGGTACAGCAAAAACCAATACTTTGCAAAGAGGATCCCTGTTAGAAGCCCTTACTATAGCTCCAGATCGACCATACAGTCCATTATCGTTTCATATACCAACAGTGCATTACAATAAGTTTCAATCCGGAGAATACCAGCCTTTGTCATCTGAAGCTGTGCCATCACTATCGTTTTCGGAATGTAGTCAATCTAATCAATCAAATCAAACTTCAGAACAAAGTGCAATACAAAATTCTGGAATTTTGCAATCAACAGCAGCAACTTCAGCATTCAAACCGGTGTGCAAACAAACGTTTCCACCTCCAAAACCACAAGAATTATCGTCACAACCTTTTTCTAATTTAAGTAACCAAAGCAGTGGTTATAAAGATTTCTCTCAATCAAAACAGGAAACAGTACAGGAGTTTAAACTGTCACAAAATAAAATGCAACAACATGAATCCTCTATTTCTACAACTGCTTTACCCAGATCAGAGGGTATTCCACAATATCAACAAGTTCTAGTACCAGAAGTATATAACCCAGAGCAGCAAAAGTCTTTCACGCCATGCAACCCAACAAATAAAGTCGCTAAAACGGACAGACCGATGACTTTCCAACCAGTTGTAGATGAAACATCGCTTAGAATATCACCTGCTCGGAGCCGTCCAACAACACCAGGTATGATCAATAAGCCGGCTCCCATAATACCTCATTATCAAATGAATTTAGTTTCTATTGAGCATCAAGCTCCAGAAAGCCGTTTATACAAACCTGGTAGTGCTGAAGTCAGTCGTTCTTCAACTCCCATAGCACGATGTCATTCTCCGGCACCTGGGCCTCCAGCTAATCCTCTAAAGGCACAAGCTCCCAGAATAAAACAAACATCTCAATCTACCTTTGAACAAGCACAATCTTGCAATAAGCAATTGTCAAACATCCGAAAAGAACACGAGATGTCAGGACCAGAGTTACAATCTCAGATGTCATCGTTTAGTCCTTCAGATTTAAAAACTTATAACCCGAATCAACCCACGGTTATAAAAGACGAAAGACAAGCAAATGTAGAATATCGCACGGAAAATTATAGTCAAGGTGATCTGAATATAAAAGAAGACTCTATGTTAAATCAGAACTACGGCGAAAAACAAACAGAATTTCAAAATATAAGTGAATATGGCGACACTACCGTTCAGACTACAAAAAAAACCTTCGAAGAATACGAAAGCTCACAATCCGCTAAGGTTATAGAAATTCACAAGGGCGATTCACAAACTTACTGTTTAAATCAGCCGATTGATTCTAATACGCAACCATATAATTCTAACTCCAAGCAGGTGTTCCCCCCTCCTCTTTCAACCATGACTCCAACCCAACAAAACTTACTTCGTACTAATGACAGTGTTGTCAATGCTAGTAAACGATCAGAAATGTATCAACCAACTCCTTTTATATCTGGTGCTAACCAAGGTCCAGTGTGTGATCCTACCCCATCAACAGGTTCCAGTGTAGGAGCTGCGGCTCGTGGTAAAGCTTTTGGCGTTTCATCAGCTCCAAAACGTGGCAGGGTTAATGAATGTCAAACAGCGAACAATCTACACCGAACCGATTATAAAAGTCCGGCTGAAGAACGGCTAATACGAAAAATGGCAAAAATGGCCCTCAACGGCTATGAAATTGGAATTCAGAGAAAAATAAAACCAGCCGACCATATACAGTCCATGGCTGACAAAATTCAAGATACGGTGGTTACAAAACATCCTCTCAACACCGACACCATACCTTCTGCTGATACTAGTCGTGATAATACTTTGAAACGATCAGTAAACAAGAAGTTTGAAAATAATTTGAAATCCATTGATTATACGCCTCTGAAACCAACCAATGGAAGCTACTTACCAAACAGTAACGGGACAAATAACACGTTCAACAATACACATACACCCATCCCTCCACCTCTTCCAACAACTCCGGTGCCAACATTTCAAGTGCACAGCACGCCTATCACAAATATATTGAATAGCATTGTACCACAGAATTCAGAGTACAATAACGCTATTAAGAACAATATCTATGACGAATCCACGTTTAATGAGAAAACTCAAAATGGTGGTTATTCAAGCAATGACGATTCAATTAAAAGCAAATCAGAAAATATTTTTGACAGCATCTCCAAAAAGAAAACGGCTTTCGAAAAAGTTACGGATGATTCGAATTCTTCAAATCATCATGATTTTAAAAATGAGTCTAAATCTGAAACAAATTATTCCAATACATTGAGCAAAAGAAGATTATTCGAGAGAAGCGACGCTTTCAATGAATCCTTGACAAAAAATACGCAGAGGAGTACGGAAAACAGTAAAATGTCACATCATGACAAAGAGATTCTAAATAAAAACAGTTACTCCAAAAATAATTTTACCCAAGAGTCCAAGATAACAAAGGAACAAGACGAGAACGACAAAAAACTACCGAATGATCTAATTGATGATTCTTTGTATAATTTTAAACCAGTGCTTAACGGAGGCGCTTCGTGTAGTAGCAGTAACAGTGGTTACAGTGGAAACAGTAGCGGGAGTCGGAGCAATGGGATCAGTAACAAACAGGAAAAGGATTCAGACGGTTACTGCGAAGAAGTTGTTGTAAAACGCAGACAGAAAAATAATAGAAACGACAACGGCCGTAGAGATTCCCGAATCGTCGCGAGACCATTGAGTACCATGACCTCAGAGGATGTGACCGACGGGCAATACATTTGTCATGTATGTGACAAGGCCATTACCAGGGGTCCGTTTATTACCGCGTTGGGTCGTATTTGGTGTCCTGAGCACTTCGTTTGTGTTAGCGCATCCTGCCGACGTCAGTTGCAAGACATTGGCTTCGTGGAAGAAAATGGCCAACTATACTGCGAGTTCTGCTTCGAGCAATACATCGCCCCTCCTTGCGACAAGTGTCATAACAAGATAAAACAGGACTGCCTGACCGCTATCGGCAAACGCTTCCATCCAGAATGCTTCAACTGTGTATACTGCGGCAAACTGTTCGGGAACAGTCCGTTCTTTGTAGAAGACGGTCTGCCATATTGTGAAGCAGATTGGAACGAGCTGTTTACGACCAAATGTTTCGCTTGCGGTTTCCCCGTGGAGGCCGGCGACAGGTGGGTGGAGGCGCTCAACAATAACTACCACAGTCAGTGCTTCAACTGCACGGTGTGCAAGAAGAACTTGCAAGGGCAGAGCTTCTTCGCCAAGGGAGGTCGACCTTTCTGCAAGTCTCACGCCCGCTAG

Protein sequence:

>DPOGS202447-PA
MAQLITVRLNKSDQQPLGFRLQGGKDFGTPLVVQKVNGGSAAERAGLQAGDALIRVNNTDVYSLRHQEAQDAIRAAGGNLELTVQRGGGTWRPTVTPTGSLPRPGSRPLGAAPAPVTSTSLKATPQPSRAFGSGHNNVAKPFGYMNGNDSVKSIVNKQYNTPVSMYSDKTIAETLSAQTEVLAGGVLGVNFKKNEKTYDAEKSAVFKVLQEAENDPEPVSEASPGATTPVSGLRHVSAPVARDTPVNTGGLPTGQNICEDCERLITSAEAPRFLPSSRLAHLAPEAPHRPSIPLGCSRVLSDGRVALGPPQPPHGPLNAPTEAPHCSECNGHIVGVFVRIKDKNLHVECFKCATCGSSLKNQGYYNLNGKLYCDIHAKLVARQNPPAPNLEPVTVAPGGRVPTNAYSTPLPPLSTNNYTNGSSSMFSPSSNLSGPKPFGSSLGTYSPSSLSPRSAPLSPRTPNSAPAPAPAPAPAPQHAFAPTKNVKSIVWPPPNPSEDEPESELNVNCNQTSLHSDFTSLSEETITQNTNNRKITANEIKTTDSSITPLLQTMEFSSNALFDTTQMASSTIKSQTAVQKQRSEIKQEFSSCTVQSYSQQASDCKQFSVYQNVVAPEEMNKRNNTMEKSRLEHSSNVVQSKFKTGESVSQNFPPLNANTSNTKMQPCWNKVSDNKIQTKENPMSSKVTQSGVKAPESKKSISGKPLAGTAKTNTLQRGSLLEALTIAPDRPYSPLSFHIPTVHYNKFQSGEYQPLSSEAVPSLSFSECSQSNQSNQTSEQSAIQNSGILQSTAATSAFKPVCKQTFPPPKPQELSSQPFSNLSNQSSGYKDFSQSKQETVQEFKLSQNKMQQHESSISTTALPRSEGIPQYQQVLVPEVYNPEQQKSFTPCNPTNKVAKTDRPMTFQPVVDETSLRISPARSRPTTPGMINKPAPIIPHYQMNLVSIEHQAPESRLYKPGSAEVSRSSTPIARCHSPAPGPPANPLKAQAPRIKQTSQSTFEQAQSCNKQLSNIRKEHEMSGPELQSQMSSFSPSDLKTYNPNQPTVIKDERQANVEYRTENYSQGDLNIKEDSMLNQNYGEKQTEFQNISEYGDTTVQTTKKTFEEYESSQSAKVIEIHKGDSQTYCLNQPIDSNTQPYNSNSKQVFPPPLSTMTPTQQNLLRTNDSVVNASKRSEMYQPTPFISGANQGPVCDPTPSTGSSVGAAARGKAFGVSSAPKRGRVNECQTANNLHRTDYKSPAEERLIRKMAKMALNGYEIGIQRKIKPADHIQSMADKIQDTVVTKHPLNTDTIPSADTSRDNTLKRSVNKKFENNLKSIDYTPLKPTNGSYLPNSNGTNNTFNNTHTPIPPPLPTTPVPTFQVHSTPITNILNSIVPQNSEYNNAIKNNIYDESTFNEKTQNGGYSSNDDSIKSKSENIFDSISKKKTAFEKVTDDSNSSNHHDFKNESKSETNYSNTLSKRRLFERSDAFNESLTKNTQRSTENSKMSHHDKEILNKNSYSKNNFTQESKITKEQDENDKKLPNDLIDDSLYNFKPVLNGGASCSSSNSGYSGNSSGSRSNGISNKQEKDSDGYCEEVVVKRRQKNNRNDNGRRDSRIVARPLSTMTSEDVTDGQYICHVCDKAITRGPFITALGRIWCPEHFVCVSASCRRQLQDIGFVEENGQLYCEFCFEQYIAPPCDKCHNKIKQDCLTAIGKRFHPECFNCVYCGKLFGNSPFFVEDGLPYCEADWNELFTTKCFACGFPVEAGDRWVEALNNNYHSQCFNCTVCKKNLQGQSFFAKGGRPFCKSHAR-