Monarch geneset OGS2.0

DPOGS200812
TranscriptDPOGS200812-TA1470 bp
ProteinDPOGS200812-PA409 aa
Genomic positionDPSCF300249 + 26874-32604
RNAseq coverage214x (Rank: top 45%)
Annotation
HeliconiusHMEL0119220.078.64% 
BombyxBGIBMGA005386-TA4e-8487.28% 
DrosophilacenB1A-PA9e-11047.55% 
EBI UniRef50UniRef50_UPI00017588546e-14064.42%UPI0001758854 related cluster n=1 Tax=unknown RepID=UPI0001758854
NCBI RefSeqXP_975199.21e-14064.42%PREDICTED: similar to centaurin beta [Tribolium castaneum]
NCBI nr blastpgi|2700140572e-13964.42%hypothetical protein TcasGA2_TC012753 [Tribolium castaneum]
NCBI nr blastxgi|2700140574e-13462.24%hypothetical protein TcasGA2_TC012753 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-31protein binding
GO:00057375.7e-06cytoplasm
KEGG pathwaytca:6640893e-140 
 K12489 (ACAP)maps-> Endocytosis
InterPro domain[275-375] IPR0119931.3e-31Pleckstrin homology-type
[275-372] IPR0018499.1e-18Pleckstrin homology domain
[19-217] IPR0041485.7e-06BAR
Orthology groupMCL11312 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200812-TA
ATGAAGCCTCTAATAGATTTTGATGAATGTCTCAGAGATTCACCCAAATTTAGGGAACAGCTGGAAACCGAAGAGGCTAACATTGAGGGTTTGGAACAAAAGCTTGATAAAGTTCTCAAGACTTGTTCTGTTATGATTGAATCGGGCAAAACATATATGAATCATAGAAGCACATTCACAAATGCTCTGTGGGATCTGAGCAGTAGCTTTTCAGAAGATTCAACTGTGGTGTCAGCATTGCATAGAATGATACATGCTTTGCAAGAAATGACCAAGTTCCATTCAATACTGTTGGATCAAGCCTCAAGAACAATACTTAAGAACCTTACAGCTTTTATTAAAGTTGACATTAAAGGTGTGAAAGAAAGCAAACATCATTTTGATAAAATCTCAAATGAATTGGATATAGTATTAAATAGAAATTCGCAGGTATCTCGTCATAAGTCCACTGATGTGGAAGAAGTCATGAATCTCTTACTTGCCACTCGATCATGTTTCCGGCACACAGCACTCGATCATGTCCAGAAGATTACGATGCTTCAAGTGAGGAAACGTCATGAGATATTGGCTACGTTCCTATCGTATTTGCAAGCCTGTAGCACATACAATCACCAGGGCGCCGATCTGTCCGAAGATTTGGAACCATTTCTTAAATCCACCGCTGACGAGATTGCAACAATGCGTAACGATACTAAGTCGCTGGACAAAGAGATGGAAAATCGTCATACTATAGTTAATAGCAAGGATACAGTGCTACCAAGTTGTATCATGAGCAGTGACGGTGAGAAGGAAGGTACGATCAGCCCGTGTCTGAAGAACATGCCCAGGATGCAGGGGTATCTGTTCAAGCGAACGTCCAACGCCTTCAAGACCTGGAACAGAAGATGGTTCTATTTATACGATAACCGACTCGTCTACAGGAAAAGAACCGGTGAATTGAACGTTACTGTTATGGAAGAGGATTTGCGACTGTGTACCGTCAAACCTGTGTATGACGGGGAGAGGAGGTTCTGTTTTGAAGTACTATCACCATCAAAGAGTCACATGCTTCAGGCGGATTCGGAGGAAATGTTGAACTCGTGGATAGCAGCTCTACAGAACGGTATACGGTCCGCCATACAGCAGGGACAGAGCAGGGACAACCCGGACTCGCAGATGGTGTTGGTACCCGACGACACCAAACAGACCAGCAAACATAACTTGACACCTGCTATGAAGAAAATAAGGTAATGAGATCTAAGGTTTTTTTATCGATTATCCATTTAAACGTAACTAATATTTTCGGATTATTACGCGTATTTTACTGTGACATCATTACATCTCGTCTGCCGTGATAGCGGTTCACGGTTACGTTATTCGCAACAGACTAAACTTTTATTTTACTCTTGTAAGGAAACCACAGATAAAAACAAAACACATTATGTTGACACAAATATTGTCATAGACCAACACACAGTTTCTACAAAATGA

Protein sequence:

>DPOGS200812-PA
MKPLIDFDECLRDSPKFREQLETEEANIEGLEQKLDKVLKTCSVMIESGKTYMNHRSTFTNALWDLSSSFSEDSTVVSALHRMIHALQEMTKFHSILLDQASRTILKNLTAFIKVDIKGVKESKHHFDKISNELDIVLNRNSQVSRHKSTDVEEVMNLLLATRSCFRHTALDHVQKITMLQVRKRHEILATFLSYLQACSTYNHQGADLSEDLEPFLKSTADEIATMRNDTKSLDKEMENRHTIVNSKDTVLPSCIMSSDGEKEGTISPCLKNMPRMQGYLFKRTSNAFKTWNRRWFYLYDNRLVYRKRTGELNVTVMEEDLRLCTVKPVYDGERRFCFEVLSPSKSHMLQADSEEMLNSWIAALQNGIRSAIQQGQSRDNPDSQMVLVPDDTKQTSKHNLTPAMKKIR-