Monarch geneset OGS2.0

DPOGS206602
TranscriptDPOGS206602-TA2721 bp
ProteinDPOGS206602-PA906 aa
Genomic positionDPSCF300048 - 1308668-1317641
RNAseq coverage6415x (Rank: top 2%)
Annotation
HeliconiusHMEL0088330.098.65% 
BombyxBGIBMGA012171-TA4e-9937.90% 
DrosophilaActn-PC0.085.86% 
EBI UniRef50UniRef50_P180910.091.08%Alpha-actinin, sarcomeric n=76 Tax=Eumetazoa RepID=ACTN_DROME
NCBI RefSeqXP_001982582.10.091.39%GG12661 [Drosophila erecta]
NCBI nr blastpgi|2700057840.091.51%hypothetical protein TcasGA2_TC007894 [Tribolium castaneum]
NCBI nr blastxgi|2700057840.091.51%hypothetical protein TcasGA2_TC007894 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-80protein binding
GO:00055094.2e-35calcium ion binding
KEGG pathwayder:Dere_GG126610.0 
 K05699 (ACTN)maps-> Amoebiasis
    Regulation of actin cytoskeleton
    Tight junction
    Adherens junction
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Leukocyte transendothelial migration
    Systemic lupus erythematosus
    Focal adhesion
InterPro domain[1-240] IPR0017151.4e-80Calponin homology domain
[833-906] IPR0119924.2e-35EF-hand-like domain
[836-902] IPR0148376.6e-27EF-hand, Ca insensitive
[409-512] IPR0020171.9e-24Spectrin repeat
[411-512] IPR0181593e-21Spectrin/alpha-actinin
Orthology groupMCL10565 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206602-TA
ATGAGTGTTCGCAAATTTAGAAATAGGGTTGCACCCTCTTTACAACAAAACTCCCCCATCAGAAGCGCAACCTTCACAGCATGGTGCAACAGCCATTTGAGGAAGGCGGGAACTGGCATCGAAAACATCGAAGAAGATTTCCGCAATGGCTTGAAACTGATGCTGTTGCTTGAAGTCATTTCAGGAGAGACCCTACCCAAGCCTGACCGCGGCAAAATGCGTTTCCACAAGATCGCTAATGTCAACAAGGCTCTAGACTTCATTGCCTCAAAAGGTGTGAAGTTGGTCTCCATCGGTGCTGAGGAAATTGTGGATGGCAACCTTAAGATGACCCTTGGTATGATTTGGACCATCATCTTGAGATTCGCAATTCAAGACATCTCAGTCGAAGAAATGACCGCAAAGGAAGGTCTACTTCTGTGGTGTCAACGAAAGACAGCGCCTTACAAGAATGTAAACGTGCAGAACTTCCATCTGTCCTTCAAGGACGGTCTCGCGTTCTGTGCTCTGATTCACAGGCACAGACCAGATCTGATCGACTACAGCAAGCTGTCCAAAGACAATCCCTTGGAGAACTTGAACACCGCCTTCGATGTCGCGGAGAAATATCTAGACATCCCTCGCATGTTGGACCCCGACGACCTTATAAACACGCCCAAGCCCGACGAACGCGCCATTATGACCTATGTGTCATGTTACTACCACGCGTTCCAAGGCGCGCAACAGAACACGGCCATGCCGGACGAGCGAGCCGTCATGACCTACGTCTCGTCTTACTACCATTGCTTCTCTGGCGCTCAGAAGGCTGAAACTGCCGCAAATCGCATCTGCAAAGTTCTCAAAGTCAACCAAGAGAACGAACGCCTCATGGAAGAGTACGAACGTCTGGCTAGTGATCTACTGGACTGGATCCGACGCACTATGCCTTGGCTGAACAGCCGCCAAACCGACAACTCCCTCGCCGGCTGTCAGAAGAAACTGGAGGATTACCGTACTTACAGGCGTAAGCACAAGCCACCACGTGTAGAACAGAAAGCCAAGTTGGAGACTAACTTCAACACGCTGCAGACGAAACTACGTCTCAGCAATCGCCCCGCCTACATGCCCACCGAGGGCAAAATGGTTTCTGACATCGCCCAAGCTTGGAAGGGTCTCGAGATAGCGGAGAAGGCCTTCGAAGAATGGCTGCTCTCCGAGATGATGAGACTCGAGAGGCTCGAGTATCTGGCTCAGAAGTTCAAGCACAAGGCTGACATACACGAGGACTGGACAAGAGGCAAGGAAGAGATGCTTCAGTCCCAGGACTTCAGGCAGTGCAAGTTGTATGACATCAAAGCGCTGAAGAAGAAGCACGAGGCCTTCGAGAGCGATCTGGCGGCCCACCAGGACCGCGTCGAACAGATTGCAGCCATCGCTCAGGAGTTGAACACTCTGGAGTACCACGAGGTTGGCGCGGTGAACGCTCGCTGCCAGCGCATCTGCTCCCAATGGGACCGCCTCGGAGCCCTGACTCAGCGCCGTCGCGCCGCCCTGGACGACGCCGAACGCCTGCTCGAACAAATCGACCTCCTGCATCTCGAGTTCGCAAAGAGAGCCGCGCCCTTCAACAACTGGTTGGACGGAACTCGCGAGGACTTGGTCGACATGTTCATCGTGCACACCATCGAAGAGATCTCCGGTTTGATGGACGCCCACGCGCGGTTCAAGGCGACCCTCGGAGAAGCGGACAAGGAGTACCAGGCCATCGTCAATCTCGTCCACCAGGTGGAGTCCATCGTCAAGCAACACCAGATACCTGGAGGATTGGAGAACCCATACACCACGCTCACCGCTCATGAGTTGAACCGCAAGTGGTCCGACGTGCGGCAGTTGGTGCCGCAGCGGGACAGCACTCTAGCGGCAGAGCTCCGTAAGCAGCAGAACAACGAGACGCTCAGGAGACAGTTCGCGGAGAAAGCCAACGCTGTTGGACCCTGGATCGAGAGGCAGATGGACGCCGTCACCGCCATCGGCATGGGACTACAGGGCTCTTTGGAAGACCAATTACATCGGCTGAAGGAGTACGAGGCGGGAGTGTACGCTTACAAACCGCACATCGAAGAACTCGAGCGCATCCATCAAGCCGTGCAAGAGGGCATGATATTTGAGAACAGATATTCACAATACACTATGGAGACTCTTCGTGTGGGCTGGGAGCAGCTCTTGACTTCCATCAACCGCACCATCAATGAAGTGGAGAACCAGATCCTCACCCGCGACTCGAAGGGCATCACCCAGGAACAACTCACTGAATTCCGCGCATCATTCAACCACTTCGACAAAAACCGCACTGGTCGTCTGGCTCCTGAAGAACTCAAGTCTTGCCTCGTGTCACTCGGATACAGCATCGGTAAAGACAGGCAAGGGGAACTGGACTTCCAACGCATTCTTGCTGTCGTCGACCCTAACAACACTGGCTACGTCTCGTTCGATGCCTTCTTGGACTTCATGACCAGAGAATCAACTGACACCGACACTGCCGAACAGGTTATTGACAGCTTCAGGATCTTAGCCGGTGACAAGCCGTATATAACCGCGGACGAGTTGCGTCGTGAGCTGCCTCCCGACCAGGCGGAGTACTGCGTGGCTCGCATGCCGCCATACCGTGGACCGAACGCTCCGCCACACGCACTCGACTACATGGCCTTCTCCACCGCACTCTACGGGGAGACTGACCTCTAA

Protein sequence:

>DPOGS206602-PA
MSVRKFRNRVAPSLQQNSPIRSATFTAWCNSHLRKAGTGIENIEEDFRNGLKLMLLLEVISGETLPKPDRGKMRFHKIANVNKALDFIASKGVKLVSIGAEEIVDGNLKMTLGMIWTIILRFAIQDISVEEMTAKEGLLLWCQRKTAPYKNVNVQNFHLSFKDGLAFCALIHRHRPDLIDYSKLSKDNPLENLNTAFDVAEKYLDIPRMLDPDDLINTPKPDERAIMTYVSCYYHAFQGAQQNTAMPDERAVMTYVSSYYHCFSGAQKAETAANRICKVLKVNQENERLMEEYERLASDLLDWIRRTMPWLNSRQTDNSLAGCQKKLEDYRTYRRKHKPPRVEQKAKLETNFNTLQTKLRLSNRPAYMPTEGKMVSDIAQAWKGLEIAEKAFEEWLLSEMMRLERLEYLAQKFKHKADIHEDWTRGKEEMLQSQDFRQCKLYDIKALKKKHEAFESDLAAHQDRVEQIAAIAQELNTLEYHEVGAVNARCQRICSQWDRLGALTQRRRAALDDAERLLEQIDLLHLEFAKRAAPFNNWLDGTREDLVDMFIVHTIEEISGLMDAHARFKATLGEADKEYQAIVNLVHQVESIVKQHQIPGGLENPYTTLTAHELNRKWSDVRQLVPQRDSTLAAELRKQQNNETLRRQFAEKANAVGPWIERQMDAVTAIGMGLQGSLEDQLHRLKEYEAGVYAYKPHIEELERIHQAVQEGMIFENRYSQYTMETLRVGWEQLLTSINRTINEVENQILTRDSKGITQEQLTEFRASFNHFDKNRTGRLAPEELKSCLVSLGYSIGKDRQGELDFQRILAVVDPNNTGYVSFDAFLDFMTRESTDTDTAEQVIDSFRILAGDKPYITADELRRELPPDQAEYCVARMPPYRGPNAPPHALDYMAFSTALYGETDL-