Monarch geneset OGS2.0

DPOGS205407
TranscriptDPOGS205407-TA2484 bp
ProteinDPOGS205407-PA827 aa
Genomic positionDPSCF300407 + 155055-169401
RNAseq coverage987x (Rank: top 13%)
Annotation
HeliconiusHMEL0050180.064.71% 
BombyxBGIBMGA001577-TA1e-14461.74% 
DrosophilaSrc42A-PB7e-0831.21% 
EBI UniRef50UniRef50_E2BTV59e-1926.32%Testis-expressed protein 14 n=1 Tax=Harpegnathos saltator RepID=E2BTV5_HARSA
NCBI RefSeqXP_625262.22e-2225.75%PREDICTED: similar to testis expressed sequence 14 isoform b [Apis mellifera]
NCBI nr blastpgi|3287829572e-2125.75%PREDICTED: hypothetical protein LOC551685 [Apis mellifera]
NCBI nr blastxgi|3287829574e-2124.48%PREDICTED: hypothetical protein LOC551685 [Apis mellifera]
Group
Gene OntologyGO:00167728.8e-29transferase activity, transferring phosphorus-containing groups
GO:00046725.1e-16protein kinase activity
GO:00064685.1e-16protein phosphorylation
GO:00055241.1e-08ATP binding
GO:00046741.1e-08protein serine/threonine kinase activity
KEGG pathwaybta:5062709e-10 
 K05725 (PTK2, FAK)maps-> Axon guidance
    Amoebiasis
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Small cell lung cancer
    Chemokine signaling pathway
    Pathways in cancer
    Leukocyte transendothelial migration
    Focal adhesion
    ErbB signaling pathway
    VEGF signaling pathway
InterPro domain[290-576] IPR0110098.8e-29Protein kinase-like domain
[289-420] IPR0012455.1e-16Serine-threonine/tyrosine-protein kinase
[283-569] IPR0022901.1e-08Serine/threonine-protein kinase domain
Orthology groupMCL26598 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205407-TA
ATGGGTGTTTTCAAGTGGCTACGCCGGGAACGTCTCACAGTTACCACTGGCCGTGTGAGCCCGGCATCCAGCGTCGATGACCCACCGGTTCAAGTGGGCACCATCAGGACGGAGGAGTACAAGAGAGATCTGGCCGAATTCCAAGCGAGACGATCAAATGGCAGCAAGGATGTCCAAGACACACCGAACAAAAAGAAATGCGGTCGACACTCTCTGCCCGTATCTTGCACCAATTGTACCGGTGATCCGGACGCGACTTTCAATTACGAAAAGAAATCGGCTATGAAATATAAGAAGAAATCACAGAAGCGAACAGGAAGCATCCATTCCGAGGAGAAGCTGTACGAAGGTGCCCCCAACATTCACTTTCTATTTCAGAACCAAGTTTTCATGCCGGGGAATATGTTCGCGTCTTATTCCGAGCCTCGCAAGCAACGTAAGCATCGTCAGAGATCGCCAGATTTTCTAGTGCAGAAACAAATAGACTTCAAGGACGACCAACCCTTGACGCCGCCGCTGTTTTATAAGGACAGTTCGTTACCATTTGACACTAAAAACTTTTTAGAGTTATCGAGGAGCGATGACGAATTAGACGGTGTGAAAACTGGTGACGCGAAGAAGTCCAGCGAGGGTTCTGGCAGCCATAAGAACGACAGCTGTGACGTCAAACTGTGGGAGGTTATGAGCGAACTGAAACATTTTGACAAATGGGCCGATGAACAATTACAGGCTCCGTCGACGACCAGCAAATCAGATGACAGTAGGAGCGACACCAGCGCTTTCGGTCTGCCGATAGCCAGCGCCTGCAATCTGGACGTTACAATTGAAAGGAATAGTTCTTGGAGTTTGGGCAAATGGGGCGTCGTGCCAGTTCAGATAAAAAGACTCAATGACGTCACAGTGGAGCATGTCAGGAAGAAATGGAATATGGAGATTAATATATTGAGAAAATTCCGCCATCCAAACATAATCCTCCTCATGGGTGTGTACCCGGATGTCCAGAACAATGTGCACCTGATATGTGAGAGATGCATCGATAGCTTGTATGGAATGCTGCATATGCAGGGTCGTATTCTAAGCATCCAAACATCGATCCATTATACCCTGGATATAACAAATGCCCTGGTCTTTCTGAACATGCAAGGCTACCTTCATACAGCATTGACCAGCAAAAGCATTATGATAACGGCCCAAGATATAGCCAAGATAGCCGACCTCCAGCCCTGTACGAGATTGACGAAGAAGAAAATATATGACAAAGGCTATGACACTTACAAATCTGAACCTCACTATACTAACATTAGCGATAACCGGACGTCATCAGAGAGCGAGCCGCTTCTGACTCAGAGATCGGACACTGATGTTGTGTCAAAGCCCTACGATTATTACTTAGAGATGACGGACTACAACTGGCAGGCCCCGGAGCTGTTTGAACCTGATGAAGGGATGGTGCACCCGTGTAACAAAAGCGATGTCTATTCCCTATGTCTCATACTGTGGGAGTGTTGTAACGCCTCAGTTCCCTGGAAGACACTAAATTATGCCAAGCTTAAAGAGTTGTATACATTATGGAGGACGGGTCTCCAGCTGCCCAGGGACGGTTCATACCCTGGTTGTATACTAGCAGTACTGGAAGCAGGGCTCAAGTTAGACCGGGGACACAGAATCGACTTGGGGGGACTCCAAGTGGCACTGCAAAAGGCAAAGCATGATTTAGACGAAATCGAATACATTATATTGCCGGAGAAAATCGCCAAGAAGAAATCTAGTTTCAGCACGCAACCTTGGGATACGACCTCGCCCACCGACTCCGATTATCAAAGCCCAAAAAGTTTAGAGCATACAGCGGTCGTGCACAAACACACGGACAGTTCGACGGGCAGCGATAATGAAGTTAAATCCAGTCACAATGAGACATTGCGCAAAACGAACCTACTCACATCATATACAACCGAAAAATGTTTCGTATCCGACGAGGAAGACAATGTACACACGTACCATTCACAAATCAAAGATTATTCAAGCGCCTGCTCGACGCCGATAGCAAAGCTCAGAGCCAGGATCAGAGACATCGAAACGCCGGGGACACTCCACAGAAGCGATTCCACGGAATACTGCAGCATAGTGAGCCCTGACACGAGATTGACGAATTTCGGACTCAACGACGATACAAATAGGGACGAAATTTCCGAACGATCCACCGCCAAACTAAGCAGAAGCTTCACACCAAAATCATATAAACCGTTGCACATAAAAGTACCTGAATACAATTTGGATTCAATCAAAACTATACTCAAAGATCGCAATGGGAACGAAAAGTCCTCTTACAATTTTGATATAAAGAATTACAGCTTACCGACAACACCCATAGCTAGGAGTAATAAGTTGAGGAAAAACGCCTGGCTGTCGGGCGATGTGGATGACAGACGGCATGCGAGGAACGCCGGGGGTCAGGAGTTGAGTAATAATAACGAAGGTAAATAA

Protein sequence:

>DPOGS205407-PA
MGVFKWLRRERLTVTTGRVSPASSVDDPPVQVGTIRTEEYKRDLAEFQARRSNGSKDVQDTPNKKKCGRHSLPVSCTNCTGDPDATFNYEKKSAMKYKKKSQKRTGSIHSEEKLYEGAPNIHFLFQNQVFMPGNMFASYSEPRKQRKHRQRSPDFLVQKQIDFKDDQPLTPPLFYKDSSLPFDTKNFLELSRSDDELDGVKTGDAKKSSEGSGSHKNDSCDVKLWEVMSELKHFDKWADEQLQAPSTTSKSDDSRSDTSAFGLPIASACNLDVTIERNSSWSLGKWGVVPVQIKRLNDVTVEHVRKKWNMEINILRKFRHPNIILLMGVYPDVQNNVHLICERCIDSLYGMLHMQGRILSIQTSIHYTLDITNALVFLNMQGYLHTALTSKSIMITAQDIAKIADLQPCTRLTKKKIYDKGYDTYKSEPHYTNISDNRTSSESEPLLTQRSDTDVVSKPYDYYLEMTDYNWQAPELFEPDEGMVHPCNKSDVYSLCLILWECCNASVPWKTLNYAKLKELYTLWRTGLQLPRDGSYPGCILAVLEAGLKLDRGHRIDLGGLQVALQKAKHDLDEIEYIILPEKIAKKKSSFSTQPWDTTSPTDSDYQSPKSLEHTAVVHKHTDSSTGSDNEVKSSHNETLRKTNLLTSYTTEKCFVSDEEDNVHTYHSQIKDYSSACSTPIAKLRARIRDIETPGTLHRSDSTEYCSIVSPDTRLTNFGLNDDTNRDEISERSTAKLSRSFTPKSYKPLHIKVPEYNLDSIKTILKDRNGNEKSSYNFDIKNYSLPTTPIARSNKLRKNAWLSGDVDDRRHARNAGGQELSNNNEGK-