Molecular automata1, 2, 3 that combine sensing4, 5, 6, computation7, 8, 9, 10, 11, 12 and actuation13, 14 enable programmable manipulation of biological systems. We use RNA interference (RNAi)15 in human kidney cells to construct a molecular computing core that implements general Boolean logic1, 3, 8, 9, 10, 11, 12, 16 to make decisions based on endogenous molecular inputs. The state of an endogenous input is encoded by the presence or absence of 'mediator' small interfering RNAs (siRNAs). The encoding rules, combined with a specific arrangement of the siRNA targets in a synthetic gene network17, allow direct evaluation of any Boolean expression in standard forms using siRNAs and indirect evaluation using endogenous inputs. We demonstrate direct evaluation of expressions with up to five logic variables. Implementation of the encoding rules through sensory up- and down-regulatory links between the inputs and siRNA mediators will allow arbitrary Boolean decision-making using these inputs.
Introduction
A molecular automaton is an engineered molecular system coupled to a (bio)molecular environment by "flow of incoming messages and the actions of outgoing messages," where the incoming messages are processed by an "intermediate set of elements," that is, a computer18. Molecular automata may implement diverse models of computation (digital and analog circuits, state machines, neural networks) to perform a variety of tasks. We suggest a general-
Molecular logic evaluators have been demonstrated in vitro1, 2, 3, 9, 11, 12 and in live cells8, 10. Up until now, only in vitro systems1, 12, 19 have shown how to evaluate arbitrary logic expressions experimentally, although arbitrary evaluation in vivo using transcription factors has been considered theoretically20, 21. Demonstration that allosteric modulation of small RNAs22, including ribozymes1, 3, 16, riboswitches4, 5 and siRNA6, regulates gene expression prompted us to suggest that, much like transcription factors, small RNA molecules will enable molecular automata to make in vivo evaluations through mediation between endogenous inputs and the downstream molecular 'computing' network.
A logic evaluator operating in an intracellular molecular milieu can serve as a binary decision-making circuit23, that is, trigger one or two discrete processes in response to inputs from this milieu.
The capacity for in vivo decision making based on endogenous inputs could find applications in basic research and medicine, such as in the diagnosis of cancer2, 24. To address this issue, we (i) recast decision-making rules as a logic expression containing intracellular inputs as variables; and (ii) construct a molecular system that produces a molecular output when the expression is evaluated as True for the given input truth values (True when present and False when absent). We propose how to construct such a system for an arbitrary question represented by a logic expression. Although our design suggests separate sensor and evaluator modules, we demonstrate only the evaluator.
There are several theoretically equivalent, but practically different, ways to answer arbitrary logic questions. They generally involve breaking a complex question into a hierarchy of simpler ones. One possibility is to be very stringent with basic modules (e.g., the first input must be True, the second must be False), but connect these modules in a less stringent way where an overall positive result is achieved when any one module gives a positive answer. Another way is to be
To construct an evaluator that embodies the first approach, we build a biological 'circuit' that comprises two or more mRNA species that encode the same protein, but have different noncoding regions. This protein is the system's output; a biologically active output may function as an actuator. If at least one mRNA species is translated, the resulting output will represent a logic True value, implementing an OR operation10, 12 (Fig. 1a). The levels of mRNA species and the output are determined by the presence or absence of the endogenous molecular inputs with the help of molecular mediators. siRNA molecules target untranslated regions and hence are natural candidates for such mediation. First, we fuse different sets of siRNA targets into the 3'-untranslated regions (UTR) of the mRNAs, rendering them susceptible to either of these siRNAs25. Next, we establish selective inhibitory links between endogenous inputs and these siRNAs. All inputs must be present at the same time to block all siRNAs and generate output from an mRNA, corresponding to a logic AND operation (Fig. 1b). Furthermore, if, for example, inputs A and B block siRNAs that target one mRNA and inputs X and Y block siRNAs that target another, the circuit will generate an output when both A and B are present or when both X and Y are present. This comprises the logic expression (A AND B) OR (X AND Y). If an activating link is established instead, the presence or absence of an input will block or enable output production from the mRNA, respectively. In logical terms, this amounts to a negation of the input 'truth value' (Fig. 1c). In the above example, input B activating its mediator siRNA turns the expression into (A AND NOT(B)) OR (X AND Y). The same input may block one siRNA and activate another, and thus appear in the expression both as itself and as its negation. This arrangement of input variables and their negations, known as literals, is called a disjunctive normal form (DNF) (Fig. 1d and Supplementary Fig. 1 online). Literals grouped
Figure 1: Design of the decision-making automaton that uses a DNF evaluator.
A biological circuit that enables the second approach comprises mRNA species that produce a transcription factor that represses an output-encoding gene. If the repressor obtained from one mRNA efficiently downregulates the output, all mRNAs must be removed to generate the output, thus implementing an AND logic operation (Fig. 2a). As before, we fuse sets of siRNA targets into the 3'-UTR
Figure 2: Design of the decision-making automaton that uses a CNF evaluator and automaton's input encoding rules.
(a) A circuit that evaluates an AND operation between mRNA molecules. A downward arrow in table indicates the absence of the mRNA. CAG, chicken
We experimentally implemented DNF and CNF evaluators in immortalized human embryonic kidney cells (293-H). We transfected the cells with the genes comprising the evaluator circuits; we also added, or withheld, mediator siRNA molecules to reflect the anticipated function of the sensory module in accordance with the presence or absence of inputs appearing as variables in expressions (Fig. 2e); and we assayed the output levels after 48 h. We chose derivatives of known siRNAs for the current implementation, and constructed five siRNA-target pairs based on published sequences from nonmammalian genes to represent up to five inputs (T1 and T2 from Renilla reniformis, FF3 and FF4 from firefly luciferases and SI4 from enhanced green fluorescent protein (eGFP); Supplementary Table 1 online). We modified the published sequences by sliding them along their parental genes to afford at least a pair of A/U bases on the 5'-end of the molecule and a pair of C/G bases on the 3'-end to ensure asymmetry in RNA-induced silencing complex assembly26.
Multi-siRNA systems may exhibit undesired crosstalk between individual molecules. We measured this crosstalk, using ZsYellow derivatives with single targets cloned into their 3'-UTR and applying all siRNA molecules at the saturation concentration, one at a time, to each derivative. Crosstalk was negligible for this set of siRNAs (Supplementary Fig. 2 online), except for a possible minor (
Figure 3: Testing individual DNF clause molecules.
(a) Two expressions in DNF form are evaluated for all possible variable assignments as indicated in the figure. 2.5 pmol of each input siRNA (or 2.5 pmol of the negative control siRNA in the case of an absent input siRNA) were cotransfected with 100 ng of each clause molecule and 100 ng of the pAmCyan-C1 transfection control into 293-H cells and assayed after 48 h. The quantitative results corresponding to the images that were obtained using FACS are shown on the right (see Methods). Red pseudocolor represents the transfection control protein AmCyan and the green color represents the output protein ZsYellow. (b) An evaluation of two CNF expressions. In C1 evaluation experiments using LacI, 10 pmol of each siRNA, 50 ng of the CMV-LacI-FF3-FF4 clause molecule, 200 ng of CAGOP-dsRed-monomer reporter and 100 ng of pAmCyan-C1 transfection control were cotransfected into 293-H cells and assayed after 48 h. The expression levels of the reporter obtained by FACS are given relative to the control experiments where active siRNA was replaced with the same level of nonsense siRNA (first row of images). In C1 evaluation experiments using LacI-KRAB, 5 pmol of each siRNA, 5 ng of the CMV-LacI-KRAB-FF3x3-FF4x3 clause molecule, 200 ng of CAGOP-dsRed-monomer reporter and 100 ng of pAmCyan-C1 transfection control were cotransfected into 293-H cells and imaged after 48 h. The expression levels of the reporter given in the figure were obtained by FACS using 100 ng of pZsYellow-C1 transfection control instead of pAmCyan-C1 and they are given relative to the control experiments where active siRNA was replaced with the same level of nonsense siRNA (first row of images). In C2
The constructs and their common sequence motif that includes a stop codon (top) are shown to the left. We cotransfected 10 pmol of each indicated siRNA (columns) with 100 ng of the indicated clause molecule (rows) and 100 ng of the transfection control plasmid pAmCyan-C1 into 293-H cells and assayed after 48 h. The images combine the fluorescent signal from the AmCyan transfection control (red pseudocolor) and the signal from the ZsYellow protein expressed from the clause molecules (green pseudocolor). Low levels of ZsYellow result in red images whereas coexpression of both proteins results in mostly green and yellow spots. Negative control is a nonsense siRNA provided in the same amount as the active siRNAs. The quantitative results that correspond to the images, obtained by FACS measurements and normalized to the negative control for each construct, are shown on the right.