On Conditional Axioms and Associated Inference Rules

Borrego-Díaz, Joaquín; Cordón-Franco, Andrés; Lara-Martín, Francisco Félix

doi:10.3390/axioms13050306

Open AccessArticle

On Conditional Axioms and Associated Inference Rules

by

Joaquín Borrego-Díaz

^*

,

Andrés Cordón-Franco

and

Francisco Félix Lara-Martín

Departamento de Ciencias de la Computación e Inteligencia Artificial, Universidad de Sevilla, E.T.S. Ingeniería Informática, Avda. Reina Mercedes s.n., 41012 Sevilla, Spain

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(5), 306; https://doi.org/10.3390/axioms13050306

Submission received: 14 March 2024 / Revised: 23 April 2024 / Accepted: 2 May 2024 / Published: 7 May 2024

(This article belongs to the Topic Mathematical Modeling)

Download Versions Notes

Abstract

:

In the present paper, we address the following general question in the framework of classical first-order logic. Assume that a certain mathematical principle can be formalized in a first-order language by a set E of conditional formulas of the form

α (v) \to β (v)

. Given a base theory T, we can use the set of conditional formulas E to extend the base theory in two natural ways. Either we add to T each formula in E as a new axiom (thus obtaining a theory denoted by

T + E

) or we extend T by using the formulas in E as instances of an inference rule (thus obtaining a theory denoted by

T + E – Rule

). The theory

T + E

will be stronger than

T + E – Rule

, but how much stronger can

T + E

be? More specifically, is

T + E

conservative over

T + E – Rule

for theorems of some fixed syntactical complexity

Γ

? Under very general assumptions on the set of conditional formulas E, we obtain two main conservation results in this regard. Firstly, if the formulas in E have low syntactical complexity with respect to some prescribed class of formulas

Π

and in the applications of

E – Rule

side formulas from the class

Π

and can be eliminated (in a certain precise sense), then

T + E

is

\forall B (Π)

-conservative over

T + E – Rule

. Secondly, if, in addition, E is a finite set with m conditional sentences, then nested applications of

E – Rule

of a depth at most of m suffice to obtain

\forall B (Π)

conservativity. These conservation results between axioms and inference rules extend well-known conservation theorems for fragments of first-order arithmetics to a general, purely logical framework.

Keywords:

first-order inference rules; conservative extensions; existentially closed models

MSC:

03B10; 03F03; 03C07

1. Introduction

In the field of logic, encompassing both mathematical and computational aspects, the question of whether to prioritize axioms or rules greatly influences the study of various problems. This dilemma affects areas like logic programming design, different axiomatizations in arithmetics, and automated reasoning, among others.

Designing rules that enhance demonstration by substituting axioms is a viable approach to addressing provability in a theory (e.g., see [1]), provided that the impact of such substitution on the theorems of the latter is analyzed. For example, it is essential to manage induction in equational reasoning [2] or noetherian induction by rewriting [3], among others. Specific representation formalisms, such as geometric logic, can also benefit from the introduction of rules to simplify the axiom set [4,5].

In some applied fields, rule-based reasoning plays a pivotal role in various realms of mathematical or computational logic, the Semantic Web, and expert systems, among others, where rules provide a flexible and potent framework for representing and applying knowledge in a computer system.

Studying the (proof–theoretic) strength of a deductive system when we replace axioms for rules is not only interesting theoretically but also useful for tasks like designing and implementing computational proof systems.

This paper presents some theoretical tools and results that can be useful in enhancing our understanding of the (proof–theoretic) strength of systems representing knowledge by means of conditional axioms or, instead, by means of rules.

1.1. From Conditional Axioms to Rules

In this work, we address the following general question. Assume that a certain mathematical principle can be formalized in a first-order language by a set E of conditional formulas of the form

α (v) \to β (v)

(for instance, an induction principle). Then, given a first-order theory T expressing some basic properties of the functions and relations involved, T can be extended by adding the (universal closures of the) formulas in E as new axioms. In this way, we obtain a new theory that we denote by

T + E

. However, it is also possible (and in some cases perhaps more convenient, see some examples below) to extend T by using the involved principle to produce a new inference rule. This can be carried out, for example, by considering for each

α (v) \to β (v)

in E an instance

\frac{\forall v α (v)}{\forall v β (v)}

of a new inference rule denoted by

E – Rule

. Let

T + E – Rule

be the closure of T under first-order logic and applications of this new rule. This approach can also be seen as a weaker version of the typical conversion process from a Hilbert-style axiomatic system to a (Gentzen) sequent system. After leaving out the external quantifiers, the conversion primarily involves identifying a sound decomposition for each axiom to generate the rules.

In general,

T + E

will be a proper extension of

T + E – Rule

, but how much stronger than

T + E – Rule

can

T + E

be? What is the exact relationship between both theories? And, are there other natural rules associated with a set E of conditional axioms that provide interesting information about

T + E

?

Before delving into the answers to these questions, let us present the scenario from which we extract both motivation and methods to deal with the previous problems: the study of conservation results between formal arithmetic theories. By a conservation result between two theories

T_{1}

and

T_{2}

, we mean a proposition stating that for some class of sentences

Γ

and every sentence

θ \in Γ

, if

T_{1}

proves

θ

, then so does

T_{2}

(and in such a case, we say that

T_{1}

is

Γ

-conservative over

T_{2}

).

Classical axiom schemes axiomatizing Peano arithmetics (say,

Σ_{n}

–induction,

Σ_{n}

–collection, and uniform

Σ_{n}

–reflection,

n \geq 1

) share basic axiomatization and conservativity properties. It is a well-known fact that it is possible to develop a uniform treatment of conservation results for these schemes and, as a matter of fact, several (mainly proof–theoretic) methods providing uniform derivations of the basic conservation results are known. These methods include Herbrand Analysis as developed by W. Sieg in [6,7], S. Buss’ witnessing method both in Peano arithmetic and in bounded arithmetic (see [8,9]), the model–theoretic approach to Herbrand Analysis developed by J. Avigad in [10], and several approaches based on applications of cut elimination, see for instance Parsons’ [11], or more recently, the approach followed by L. Beklemishev in [12,13] (also closely related to Sieg’s method).

However, the basic ideas of these methods are, to a great extent, independent of any specific arithmetic notion and, therefore, could be developed for arbitrary (countable) first-order languages and theories. In order to explore this intuition, we can isolate the main characteristics of these arithmetical contexts.

Most axiom schemes that are considered in first-order arithmetic produce (when restricted to some class of formulas) sets of conditional axioms; that is, formulas of the form

α (\vec{v}) \to β (\vec{v}),

where

\vec{v}

denotes a sequence

v_{1}, \dots, v_{n}

of possible free variables. A classical example from formal arithmetics is the

Σ_{1}

–induction scheme,

I Σ_{1}

, given by the formulas

φ (0, \vec{v}) \land \forall x (φ (x, \vec{v}) \to φ (x + 1, \vec{v})) \to \forall x φ (x, \vec{v})

where

φ (x, \vec{v})

varies within the class of formulas

Σ_{1}

in the arithmetic hierarchy. We can also introduce an inference rule,

E – Rule

, associated with E in a natural way. Namely, E–Rule denotes the inference rule whose instances are

\frac{\forall \vec{v} α (\vec{v})}{\forall \vec{v} β (\vec{v})}

for each

α (\vec{v}) \to β (\vec{v}) \in E

. The rule corresponding to the previous

Σ_{1}

–induction scheme is denoted by

Σ_{1} – IR

and consists of instances of the form

\frac{\forall \vec{v} (φ (0, \vec{v}) \land \forall x (φ (x, \vec{v}) \to φ (x + 1, \vec{v})))}{\forall \vec{v} \forall x φ (x, \vec{v})}

where

φ (x, \vec{v}) \in Σ_{1}

.

Then, given a base theory T, we are interested in conservation properties of the extensions of T,

T + E

, and

T + E – Rule

(the closure of T under first-order logic and applications of

E – Rule

.) The analysis of these conservation properties typically proceeds from some level of syntactical complexity (represented for some class of formulas

Π

) and (for some class of formulas

Γ

containing

Π

) studies of

Γ

–conservation results between the theories associated with E over the base theory T.

It is a classical theorem in the proof theory of first-order arithmetics (proved by Parsons [11]) that (over a base theory,

T_{0}

, axiomatized by some elementary arithmetic facts) the scheme

I Σ_{1}

is

Π_{2}

-conservative over

T_{0} + Σ_{1} – IR

. More precisely,

T_{0} + I Σ_{1}

is a

Π_{2}

-conservative extension of

T_{0} + Σ_{1} – IR

. In this work, we isolate conditions under which a similar result to Parsons’ theorem can be derived for a given set of conditional axioms E. We show that if the formulas in the set of conditional axioms E have a suitable syntactical complexity, then the conservation properties of the theories associated with E can be derived from simple conditions on the

E – Rule

. In order to derive these general counterparts of Parson’s theorem, the following auxiliary notions are introduced.

If, for each

α (\vec{v}) \to β (\vec{v}) \in E

, the formulas

α (\vec{v})

and

β (\vec{v})

have a low syntactical complexity with respect to the basic level

Π

(see Definition 6), then we say that E is a set of normal conditional axioms with respect to

Π

. Let us denote by

\forall B (Π)

the set of universal closures of boolean combinations of formulas in

Π

. Then, the set of

\forall B (Π)

consequences of

T + E

can be described using an auxiliary inference rule

E^{Π} – Rule

, with instances

\frac{\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to α (\vec{v}))}{\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to β (\vec{v}))}

where

α (\vec{v}) \to β (\vec{v}) \in E

and

θ (\vec{v}, \vec{z})

is a conjunction of formulas that belong to

Π

or are negations of some formulas in

Π

. Using a model–theoretic argument (based upon simple properties of a general notion of an existentially closed model, following ideas developed by Avigad in [10]) we prove that for every theory T (axiomatized by formulas with restricted syntactical complexity, see Corollary 1),

For every set of normal conditional axioms with respect to $Π$ and E, $T + E$ is $\forall B (Π)$ -conservative over $T + E^{Π} – Rule$ (the closure of T under first-order logic and applications of $E^{Π} – Rule$ ).

Let us remark here that the auxiliary

E^{Π} – Rule

is a natural device in the proof–theoretic approach to conservation results for arithmetical theories via cut elimination. As a matter of fact, the rule

E^{Π} – Rule

was considered by J. C. Shepherdson in his analysis of open induction (see lemma 2.3 in [14]). A very similar rule was also used for

Σ_{n}

induction by Parsons (see section §3 in [11]) and more recently by Beklemishev in his work on

Σ_{n}

–collection (see [12]) and

Δ_{n + 1}

–induction (see [13]). Our proof of Corollary 1 (as a consequence of Proposition 2) provides a model–theoretic interpretation of these proof–theoretic arguments.

Corollary 1 suggests the following notion (see Definition 7): we say that E is weakly Π-reducible modulo T if

T^{'} + E^{Π} – Rule

and

T^{'} + E – Rule

are equivalent for every theory

T^{'}

extending T. Putting it all together, we can state a version of Parson’s theorem in a more general context where the axiomatization of T has again a restricted syntactical complexity (see Theorem 1 for this and other related conservation results):

Let E be a set of normal conditional axioms with respect to $Π$ . If E is weakly $Π$ -reducible modulo T, then $T + E$ is $\forall B (Π)$ -conservative over $T + E – Rule$ .

This result, and more generally Theorem 1, can be considered as a reformulation of some aspects of Kaye’s model–theoretic work in [15] where, using Henkin constructions, Kaye derived conservation results in the spirit of Theorem 1. As pointed out in the survey paper [16], a uniform model–theoretic treatment of conservation results between several induction schemes and their parameter-free versions can be obtained using these ideas. The model–theoretic core of this uniform approach was developed in a general setting in [17]. The main conceptual difference between Kaye’s approach and our exposition in this paper is, on top of the use of existentially closed models, the emphasis on the role played by inference rules in these results that, we think, allows for a more systematic presentation.

In a similar vein, in Section 5, we reinterpret, using the notion of a set of normal conditional sentences, a conservation result obtained by Kaye (see Theorem 1.4 in [17]). Namely, we show that (see Theorem 3), given a theory T and a finite set E of normal conditional sentences with respect to

Π

, then

If a $\forall B (Π)$ sentence $θ$ can be derived from T together with m sentences in E, then $θ$ can be also derived from $T + E^{Π} – Rule$ using nested applications of $E^{Π} – Rule$ with a depth of at most m.

For the reader’s convenience, Table 1 summarizes the axiomatizations of the theories we will work with in this paper. Given a first-order language L, let E denote a set of conditional axioms of the form

α (\vec{v}) \to β (\vec{v})

and let

Π

denote a basic fragment of L (as defined in Definition 3). The first two theories are given by a base theory T together with a set of axioms. The remaining theories are constructed using (Hilbert-style) inference rules and thus are defined by the closure of the base theory T under applications of the corresponding rule.

Table 2 summarizes the main conservation results obtained in this paper. For a given base theory T, the theory

T + E

is conservative over each of the following subtheories (it is important to note that, for the first three conservation results, we also require the base theory T to be

\forall \exists B (Π)

-axiomatizable, whereas the remaining four hold for base theories T of arbitrary syntactical complexity).

1.2. Aim and Structure of the Paper

In this paper, we prove some conservation results between, on the one hand, an arbitrary theory axiomatized (over a basic theory T) by a set of conditional axioms E and, on the other hand, theories axiomatized (again over T) by (nested) applications of the rules

E – Rule

and

E^{Π} – Rule

. Although the results included here are not essentially new and the corresponding arithmetical versions are well-known, we think that the model–theoretic approach that we develop in their proofs is rather simple and clear. This simplicity aids in making the whole presentation of these kinds of results in a general context very easy to follow. In a fundamental sense, this paper revisits a substantial part of the work of Kaye in [16,17] through the light of the methods introduced more recently by Avigad in [10]. It also reformulates Kaye’s results in terms of natural inference rules, as can be found in more standard proof–theoretic works [12,18]. We hope this reformulation, together with the model–theoretic approach we adopt here, can contribute to making the logical content of these results more accessible to a wider audience, including researchers working on topics with no direct connection with formal arithmetics.

The structure of this paper is as follows. The next section, Section 2, specifies the basic notions and notation used in the paper. In Section 3, we present the basic model–theoretic device used throughout the paper to derive conservation results. Here, a generalization of the notion of an existentially closed model plays a central role. The notion of a normal conditional axiom is introduced in Section 4, where some conservation results between the theory axiomatized by conditional axioms and the one obtained by considering the associated rules are established. The specific case of a finite set of conditional sentences is analyzed in Section 5. This paper concludes with some considerations about the results obtained and possible lines of future research.

2. Inference Rules and Conditional Axioms

We always work in classical first-order logic with equality. Let us fix a countable first-order language L. A formula is a literal of L if it is atomic or negated atomic.

As usual, a theory T is a set of sentences of L closed under logical consequence (that is, for each formula

φ

, if

T ⊧ φ

, then

φ \in T

). An axiomatization of T is a set of formulas

Γ

such that

T = {φ : Γ ⊧ φ}

.

Through this paper, we shall deal with different sets of formulas built up from a basic distinguished set using several syntactical operations. So, we shall begin by defining these operations. Given a set of formulas

Γ

, the following notation will be used:

$\neg Γ$ is the set of formulas ${\neg φ (\vec{x}) : φ (\vec{x}) \in Γ}$ .
$\exists Γ$ is the set ${\exists \vec{x} φ (\vec{x}) : φ (\vec{x}) \in Γ}$ (the sets $\forall Γ$ , $\exists \forall Γ$ , $\forall \exists Γ$ ,…are defined accordingly using the appropriate blocks of initial quantifiers).
$Γ^{\land}$ (resp. $Γ^{\lor}$ ) is the set of all finite conjunctions (resp. disjunctions) of formulas of $Γ$ . $Γ^{+}$ is the closure of $Γ$ under disjunctions and conjunctions.
$B (Γ)$ denotes the set of boolean combinations of formulas in $Γ$ .

As usual, a tuple of elements (or variables)

a_{1}, \dots, a_{n}

is abbreviated by

\vec{a}

, and we write

φ (\vec{x})

to indicate that the free variables of

φ

are among the tuple

\vec{x}

.

For a given base theory T, extensions of T can typically be axiomatized in two ways: (1) by adding a new set of sentences E to T and closing under logical consequence, or (2) by closing T under (first-order logic and) applications of some new inference rules. In this paper, we shall explore the relationship between both extensions.

Firstly, we recall some basic notions and terminology on inference rules introduced by Beklemishev in [18]. By an inference rule, we mean a set of instances, that is, expressions of the form

\frac{φ_{1}, \dots, φ_{n}}{ψ}

where

φ_{1}, \dots, φ_{n}, ψ

are formulas. If R is an inference rule, then

[T, R]

denotes the closure of T under first-order logic and unnested applications of R (that is, a proof in

[T, R]

may contain several applications of R but they are not to occur on the same branch within the proof). By recursion on

k \in ω

, we define

{[T, R]}_{0} = T

and

{[T, R]}_{k + 1} = [{[T, R]}_{k}, R]

. The closure of T under first-order logic and applications of R is

T + R = ⋃_{k \in ω} {[T, R]}_{k}

.

Definition 1.

Let

R_{1}

and

R_{2}

be rules and let U be a theory.

1.: The rule $R_{1}$ is derivable from $R_{2}$ modulo U if for every extension T of U, $T + R_{2}$ extends $T + R_{1}$ .
2.: The rules $R_{1}$ and $R_{2}$ are equivalent modulo U if for every extension T of U, $T + R_{1} \equiv T + R_{2}$ (that is, they are equivalent theories).
3.: The rule $R_{1}$ is reducible to $R_{2}$ modulo U if for every extension T of U, $[T, R_{2}]$ extends $[T, R_{1}]$ .
4.: $R_{1}$ and $R_{2}$ are congruent modulo U if for every extension T of U, $[T, R_{1}] \equiv [T, R_{2}]$ .

Many significant mathematical and combinatorial principles are expressed in the form of implications, where the satisfaction of a certain condition A entails the presence of another condition B. The formal representation of these principles corresponds to formulas whose outermost connective is the logical implication symbol →. This motivates the following definition.

Definition 2.

We say that a set E of L-formulas is a set of conditional axioms if every element of E is a formula of the form

α (\vec{v}) \to β (\vec{v})

(recall that for a formula φ, we write

φ (\vec{v})

to mean that the free variables of φ are among the variables

\vec{v}

).

Let T be an L-theory and let E be a set of conditional axioms. Then, by definition,

T + E

is the theory axiomatized by T plus the universal closure of every formula in E, i.e., the theory given by T plus

\forall \vec{v} (α (\vec{v}) \to β (\vec{v})),

for each

α (\vec{v}) \to β (\vec{v}) \in E

.

A natural inference rule,

E – Rule

, can be associated with E by considering the instances

\frac{\forall \vec{v} α (\vec{v})}{\forall \vec{v} β (\vec{v})}

for each

α (\vec{v}) \to β (\vec{v}) \in E

.

In this paper, we explore the relationship between the theories

T + E

,

T + E – Rule

, and

{[T, E – Rule]}_{k}

(k \geq 1)

for a given set of conditional axioms.

In order to be able to determine more precisely the relative proof–theoretic strength of these theories, we shall fix some level of syntactical complexity for the formulas considered. This will be performed through the notion of a basic fragment of a first-order language:

Definition 3.

A set of formulas of L, Π, is a basic fragment of L if Π satisfies the following conditions:

1.: Every atomic formula of L belongs to Π.
2.: If $φ \in Π$ and θ is a subformula of φ, then $θ \in Π$ .
3.: If $φ (x, v_{1}, \dots, v_{n}) \in Π$ and t is a term of L, then $φ (t, v_{1}, \dots, v_{n}) \in Π$ .

In short, a basic fragment is a set of formulas that includes all atomic formulas and is closed under subformulas and term substitution.

Let us enumerate a few natural examples of basic fragments.

The sets $\forall_{n}$ and $\exists_{n}$ of formulas of L (where $\forall_{0} = \exists_{0}$ denotes the class of open formulas of L and for each $n \geq 0$ , $\exists_{n + 1} = \exists \forall_{n}$ and $\forall_{n + 1} = \forall \exists_{n}$ ).
The set of all literals of L (a formula is a literal if it is atomic or negated atomic).
The set comprising all clauses (i.e., disjunctions of literals) of L.
In the context of arithmetic languages, other examples are the classes in the arithmetical hierarchy $Π_{n}$ and $Σ_{n}$ , $n \geq 0$ , (see [19] or [20]); the sets $U_{n}$ and $E_{n}$ , $n \geq 0$ , in the $Δ_{0}$ hierarchy of bounded formulas (see [21]); and the sets ${\hat{Π}}_{n}^{b}$ and ${\hat{Σ}}_{n}^{b}$ , $n \geq 0$ , of strictly bounded formulas considered in bounded arithmetic (see [20]).

Given a basic fragment

Π

, we associate E with a new set of conditional axioms denoted by

E^{Π}

and given by the set of all formulas

(θ_{1} (\vec{v}, \vec{z}) \land \dots \land θ_{k} (\vec{v}, \vec{z}) \to α (\vec{v})) \to (θ_{1} (\vec{v}, \vec{z}) \land \dots \land θ_{k} (\vec{v}, \vec{z}) \to β (\vec{v}))

where

α (\vec{v}) \to β (\vec{v}) \in E

, and for each

j = 1, \dots, k

,

θ_{j} (\vec{v}, \vec{z}) \in Π \cup \neg Π

. We will also consider its associated inference rule,

E^{Π} – Rule

, given by the set of instances

\frac{\forall \vec{v} \forall \vec{z} (θ_{1} (\vec{v}, \vec{z}) \land \dots \land θ_{k} (\vec{v}, \vec{z}) \to α (\vec{v}))}{\forall \vec{v} \forall \vec{z} (θ_{1} (\vec{v}, \vec{z}) \land \dots \land θ_{k} (\vec{v}, \vec{z}) \to β (\vec{v}))}

where

α (\vec{v}) \to β (\vec{v}) \in E

and for each

j = 1, \dots, k

,

θ_{j} (\vec{v}, \vec{z}) \in Π \cup \neg Π

.

Let us observe that

T + E

is equivalent to

T + E^{Π}

(both are different axiomatizations of the same theory) but, in general, the theories

[T, E – Rule]

and

[T, E^{Π} – Rule]

are not equivalent. Indeed,

[T, E^{Π} – Rule]

always extends

[T, E – Rule]

(to simulate an application of

E – Rule

, one applies the corresponding instance of

E^{Π} – Rule

with

θ (z_{1}) \equiv (z_{1} = z_{1})

, which belongs to

Π

for any basic fragment). However, in general,

[T, E – Rule]

may not necessarily be an extension of

[T, E^{Π} – Rule]

.

3. A Model–Theoretic Standpoint

In this section, we introduce the machinery that we will use to derive our conservation results between theories

T + E

and

T + E – Rule

, where E is a set of conditional axioms.

The methods we use in this paper are model–theoretic in nature and essentially follow the methodology introduced by Avigad in [10] who, in turn, mentions that in the context of bounded arithmetic, this methodology has been used in Zambella [22], where it is attributed to unpublished work by Albert Visser.

Central to our approach is the notion of an

\exists Π

-closed model of a theory T, where

Π

is a fixed but arbitrary basic fragment. This notion is a natural generalization of the well-known concept of an existentially closed model and extends the concept of a Herbrand-saturated model introduced in [10].

All the languages and models considered through this paper are countable. Given two L structures

A

and

B

and a set of formulas

Φ

, we shall write

A ≺_{Φ} B

to express that

B

is a

Φ

-elementary extension of

A

; that is,

A

is a substructure of

B

, and for each

θ (u_{1}, \dots, u_{n}) \in Φ

and each

a_{1}, \dots, a_{n} \in A

, we have

A ⊧ θ (\vec{a}) ⟺ B ⊧ θ (\vec{a}) .

Definition 4.

Given a basic fragment Π of a first-order language L and an L structure

A

, the Π-diagram of

A

is the set of sentences of the language

L \cup {a : a \in A}

given by

D_{Π} (A) = {φ (\vec{a}) : A ⊧ φ (\vec{a}) a n d φ (\vec{x}) \in Π \cup \neg Π}

Remark 1.

Let Π be a basic fragment of a language L and let

A

be an L structure.

1.

If

B

is an L structure and

A

is a substructure of

B

, then

B ⊧ D_{Π} (A) ⟺ A ≺_{Π} B .

2.

For every L-theory T, the following conditions are equivalent:

(a): $T + D_{Π} (A)$ is consistent.
(b): There exists $B ⊧ T$ such that $A ≺_{Π} B$ .

Remark 2.

Let us observe that, for every two L structures

A

and

B

, we have

A ≺_{Π} B ⟺ A ≺_{B (Π)} B .

The next definition introduces a straightforward generalization of the notion of an existentially closed model.

Definition 5.

Let

A

be an L structure. We say that

A

is an

\exists Π

-closed model of T if

A ⊧ T

, and for each

B ⊧ T

,

A ≺_{Π} B

implies

A ≺_{\exists Π} B .

Observe that taking

Π = \forall_{0}

, we obtain the classical notion of an existentially closed model from Model Theory; taking

Π = \forall_{1}

, we obtain the notion of a Herbrand-saturated model from [10]; taking

Π = Δ_{0}

, we obtain the notion of a 1-closed model from [23]; and taking

Π = {\hat{Π}}_{i}^{b}

, we obtain the notion of an

\exists {\hat{Π}}_{i}^{b}

-closed model from [24].

The usual chain argument for constructing existentially closed models provides us with an existence lemma. The proof of Lemma 1 is rather standard but we include a sketch so that the reader can check where the properties defining the notion of a basic fragment

Π

are needed.

Lemma 1.

Let T be a

\forall \exists B (Π)

-axiomatizable consistent theory. Then, for each

A ⊧ T

, there exists an

\exists Π

-closed model of T,

B

, such that

A ≺_{Π} B

.

Proof.

Let

A

be a model of T. In the first step, we will construct a model of T,

A^{1}

, satisfying

A ≺_{Π} A^{1}

, and for each

C ⊧ T

with

A^{1} ≺_{Π} C

and for all

φ (x_{1}, \dots, x_{n}, \vec{y}) \in Π

and

a_{1}, \dots, a_{n} \in A

,

C ⊧ \exists \vec{y} φ (a_{1}, \dots, a_{n}, \vec{y}) \Rightarrow A^{1} ⊧ \exists \vec{y} φ (a_{1}, \dots, a_{n}, \vec{y}) .

To this end, we will form a chain of models of T of power

ω

,

A = A_{0}^{1} ≺_{Π} A_{1}^{1} ≺_{Π} \dots ≺_{Π} A_{i}^{1} ≺_{Π} \dots,

and take

A^{1} = ⋃_{i \in ω} A_{i}^{1}

.

Let

{θ_{i} (a_{1}, \dots, a_{n_{i}}, \vec{y}) : i \in ω}

be an enumeration of all formulas in

Π

with parameters

a_{i}

from

A

.

$i = 0$ : Put $A_{0}^{1} = A$ .
$i \to i + 1$ : If $T + D_{Π} (A_{i}^{1}) + \exists \vec{y} θ_{i} (a_{1}, \dots, a_{n_{i}}, \vec{y})$ is consistent, there exists $D ⊧ T$ such that $A_{i}^{1} ≺_{Π} D$ and $D ⊧ \exists \vec{y} θ_{i} (a_{1}, \dots, a_{n_{i}}, \vec{y})$ . Define $A_{i + 1}^{1}$ to be $D$ . If $T + D_{Π} (A_{i}^{1}) + \exists \vec{y} θ_{i} (a_{1}, \dots, a_{n_{i}}, \vec{y})$ is inconsistent, define $A_{i + 1}^{1}$ to be $A_{i}^{1}$ .

Let us check that

A^{1} = ⋃_{i \in ω} A_{i}^{1}

satisfies the required properties.

$A ≺_{Π} A^{1}$ . As usual, by induction on the syntactical complexity of the formulas in $Π$ , it is easy to see that the union of the chain $A^{1}$ is a $Π$ -elementary extension of each model in the chain (here we use the assumption that a basic fragment $Π$ is closed under subformulas and term substitution).
$A^{1} ⊧ T$ . It follows from the fact that $\forall \exists B (Π)$ -axiomatizable theories are preserved under unions of $Π$ -elementary chains. In fact, let $\forall \vec{x} \exists \vec{y} θ (\vec{x}, \vec{y})$ be an axiom of T, where $θ (\vec{x}, \vec{y}) \in Π$ , and consider $a_{1}, \dots, a_{n} \in A^{1}$ . Pick $i_{0} \in ω$ such that $a_{1}, \dots, a_{n} \in A_{i_{0}}^{1}$ . Since $A_{i_{0}}^{1}$ is a model of T, there are $b_{1}, \dots, b_{m} \in A_{i_{0}}^{1}$ such that $A_{i_{0}}^{1} ⊧ θ (\vec{a}, \vec{b})$ . Since $A_{i_{0}}^{1} ≺_{Π} A^{1}$ and $θ (\vec{x}, \vec{y}) \in B (Π)$ , $A^{1} ⊧ θ (\vec{a}, \vec{b})$ . Hence, $A^{1} ⊧ \exists \vec{y} θ (\vec{a}, \vec{y})$ , as required.
Consider $C ⊧ T$ with $A^{1} ≺_{Π} C$ , $φ (x_{1}, \dots, x_{n}, \vec{y}) \in Π$ and $a_{1}, \dots, a_{n} \in A$ such that $C ⊧ \exists \vec{y} φ (a_{1}, \dots, a_{n}, \vec{y})$ . Pick $j \in ω$ such that $φ (a_{1}, \dots, a_{n}, \vec{y})$ is $θ_{j} (a_{1}, \dots, a_{n_{j}}, \vec{y})$ . Clearly, $T + D_{Π} (A_{j}^{1}) + \exists \vec{y} θ_{j} (a_{1}, \dots, a_{n_{j}}, \vec{y})$ is consistent and so $A_{j + 1}^{1} ⊧ \exists \vec{y} φ (a_{1}, \dots, a_{n}, \vec{y})$ by construction. But then $A^{1} ⊧ \exists \vec{y} φ (a_{1}, \dots, a_{n}, \vec{y})$ since $A_{j + 1}^{1} ≺_{Π} A^{1}$ .

Repeating the construction

ω

times, we obtain a chain of models of T:

A = A^{0} ≺_{Π} A^{1} ≺_{Π} A^{2} ≺_{Π} \dots

such that any

\exists Π

sentence with constants from

A^{i}

that holds in some extension of

A^{i + 1}

, which is a model of T, holds in

A^{i + 1}

as well. Take

B = ⋃_{i \in ω} A^{i}

. It is clear that

A ≺_{Π} B

and

B

is an

\exists Π

-closed model of T. □

Our basic device to prove the conservation results is the next lemma, which is a general version of Theorem 3.4 in [10].

Lemma 2.

Let T be a

\forall \exists B (Π)

-axiomatizable theory and let

T^{'}

be a theory such that every

\exists Π

-closed model of T is a model of

T^{'}

. Then,

T^{'}

is

\forall B (Π)

-conservative over T.

Proof.

Let

φ \in \forall B (Π)

be a sentence such that

T^{'} ⊢ φ

. We must show that

T ⊢ φ

.

Suppose that

T ⊬ φ

. Then, there exists

A ⊧ T + \neg φ

. Since

\neg φ

is an

\exists B (Π)

sentence,

T + \neg φ

is a

\forall \exists B (Π)

-axiomatized, consistent theory and, by Lemma 1 and Remark 2, there exists an

\exists Π

-closed model of T,

B

, such that

A ≺_{B (Π)} B

. Firstly, by the assumption on the theory

T^{'}

,

B

is a model of

T^{'}

. Secondly, put

\neg φ \equiv \exists \vec{y} φ_{0} (\vec{y})

, with

φ_{0} (\vec{y}) \in B (Π)

, and pick

\vec{a} \in A

, satisfying that

A ⊧ φ_{0} (\vec{a})

. Since

A ≺_{B (Π)} B

,

B ⊧ φ_{0} (\vec{a})

, and so

B ⊧ \exists \vec{y} φ_{0} (\vec{y})

. Then,

B ⊧ T^{'} + \neg φ

, a contradiction. □

In order to apply Lemma 2, we will need the following result that mirrors theorem 3.3 outlined in [10]. It establishes a connection between validity within an

\exists Π

-closed model of T and the provability within the theory T itself.

Proposition 1.

Let

A

be an

\exists Π

-closed model of T and

φ (\vec{x}) \in \exists \forall \neg Π

,

\vec{a} \in A

such that

A ⊧ φ (\vec{a})

. Then, there exist

\vec{c} \in A

and

θ_{1} (\vec{x}, \vec{z}), \dots, θ_{k} (\vec{x}, \vec{z}) \in Π \cup \neg Π

such that

A ⊧ θ_{1} (\vec{a}, \vec{c}) \land \dots \land θ_{k} (\vec{a}, \vec{c}) a n d T ⊢ θ_{1} (\vec{x}, \vec{z}) \land \dots \land θ_{k} (\vec{x}, \vec{z}) \to φ (\vec{x}) .

Proof.

Since

A ⊧ φ (\vec{a})

and

φ (\vec{x}) \in \exists \forall \neg Π

, there are

φ_{0} (\vec{x}, \vec{v}) \in \forall \neg Π

and

\vec{d} \in A

such that

φ (\vec{x})

is

\exists \vec{v} φ_{0} (\vec{x}, \vec{v})

and

A ⊧ φ_{0} (\vec{a}, \vec{d})

. Since

A

is

\exists Π

-closed,

T + D_{Π} (A) + \neg φ_{0} (\vec{a}, \vec{d})

is inconsistent. Indeed, assume that

T + D_{Π} (A) + \neg φ_{0} (\vec{a}, \vec{d})

is consistent. By Remark 1, there exists

B ⊧ T + \neg φ_{0} (\vec{a}, \vec{d})

such that

A ≺_{Π} B

. It follows from the fact that

A

is an

\exists Π

-closed model of T that

A ≺_{\exists Π} B

. But then

A ⊧ \neg φ_{0} (\vec{a}, \vec{d})

since

\neg φ_{0} (\vec{x}, \vec{v}) \in \exists Π

, which contradicts the fact that

A ⊧ φ_{0} (\vec{a}, \vec{d})

.

Therefore,

T + D_{Π} (A) ⊢ φ_{0} (\vec{a}, \vec{d}) .

In particular,

T + D_{Π} (A) ⊢ \exists \vec{v} φ_{0} (\vec{a}, \vec{v})

and hence

T + D_{Π} (A) ⊢ φ (\vec{a})

. By compactness, there exist

θ_{1} (\vec{a}, \vec{c}), \dots, θ_{k} (\vec{a}, \vec{c}) \in D_{Π} (A)

such that

T + θ_{1} (\vec{a}, \vec{c}) + \dots + θ_{k} (\vec{a}, \vec{c}) ⊢ φ (\vec{a})

. Since the constant symbols

\vec{a}, \vec{c}

do not appear in the language of the theory T, we obtain

T ⊢ θ_{1} (\vec{x}, \vec{z}) \land \dots \land θ_{k} (\vec{x}, \vec{z}) \to φ (\vec{x}),

as required. □

4. Normal Conditional Axioms

After introducing the basic model–theoretic machinery in the previous section, we are now ready to establish our first general conservation theorem between axioms and inference rules. To this end, we first need the following simple yet useful lemma.

Lemma 3.

Let T be a theory and let E be a set of conditional axioms. Then, for every basic fragment Π,

E^{Π} – R u l e

and

E^{B (Π)} – R u l e

are congruent modulo T.

Proof.

Since

Π \subseteq B (Π)

, it is enough to show that for every theory U extending T,

[U, E^{Π} – Rule]

extends

[U, E^{B (Π)} – Rule]

. Even more, since

B (Π)

is closed under conjunctions and negation, it is enough to show that for all

α (\vec{v}) \to β (\vec{v}) \in E

and

σ (\vec{u}, \vec{v}) \in B (Π)

, if

U ⊢ σ (\vec{u}, \vec{v}) \to α (\vec{v})

, then

[U, E^{Π} – Rule] ⊢ σ (\vec{u}, \vec{v}) \to β (\vec{v}) .

Given

σ (\vec{u}, \vec{v}) \in B (Π)

, there are

σ_{i j} (\vec{u}, \vec{v}) \in Π \cup \neg Π

such that

σ (\vec{u}, \vec{v}) \equiv ⋁_{i = 1}^{n} ⋀_{j = 1}^{m_{i}} σ_{i, j} (\vec{u}, \vec{v}) .

Then, since

U ⊢ σ (\vec{u}, \vec{v}) \to α (\vec{v})

, we have

U ⊢ ⋀_{i = 1}^{n} (⋀_{j = 1}^{m_{i}} σ_{i j} (\vec{u}, \vec{v}) \to α (\vec{v})) .

Then, n (unnested) applications of

E^{Π} – Rule

give us

[U, E^{Π} – Rule] ⊢ ⋀_{i = 1}^{n} (⋀_{j = 1}^{m_{i}} σ_{i j} (\vec{u}, \vec{v}) \to β (\vec{v}))

and, therefore,

[U, E^{Π} – Rule] ⊢ ⋁_{i = 1}^{n} ⋀_{j = 1}^{m_{i}} σ_{i j} (\vec{u}, \vec{v}) \to β (\vec{v}),

as required. □

The next Proposition establishes a general conservation theorem between a base theory T augmented with a set of conditional axioms E and the associated inference rule theory

T + E^{Π} – Rule

, where

Π

is an appropriate basic fragment. Namely, under very general conditions, it is possible to replace the use of an axiom from E by the use of an inference rule at the price of adding certain side formulas from the class

Π

during the application of the inference rule.

Proposition 2.

Let T be a theory, let Π be a basic fragment, and let E be a set of conditional axioms such that

(S1) For every $α (\vec{v}) \to β (\vec{v}) \in E$ , $α (\vec{v})$ is T-provably equivalent to an $\exists \forall B (Π)$ formula; and
(S2) $T + E^{Π} – Rule$ is $\forall \exists B (Π)$ -axiomatizable.

Then,

T + E

is

\forall B (Π)

-conservative over

T + E^{Π} – Rule

.

Proof.

By Lemma 3,

E^{Π}

–Rule and

E^{B (Π)}

–Rule are congruent and hence it is sufficient to show that

T + E

is

\forall B (Π)

-conservative over

T + E^{B (Π)}

–Rule. Note that

B (Π)

is also a basic fragment and that

B (B (Π)) = B (Π)

. By condition (

S 2

)

T + E^{B (Π)}

–Rule is

\forall \exists B (Π)

-axiomatizable. Hence, by Lemma 2 for the basic fragment

B (Π)

, it suffices to prove that every

\exists B (Π)

-closed model of

T + E^{B (Π)}

–Rule is a model of

T + E

.

Let

A

be an

\exists B (Π)

-closed model of

T + E^{B (Π)}

–Rule. Consider

α (\vec{v}) \to β (\vec{v}) \in E

and

\vec{a} \in A

such that

A ⊧ α (\vec{a})

. We must show that

A ⊧ β (\vec{a})

.

By condition (

S 1

), there exists

α_{0} (\vec{v}) \in \exists \forall B (Π)

such that

T ⊢ α (\vec{v}) \leftrightarrow α_{0} (\vec{v})

and so

A ⊧ α_{0} (\vec{a})

. By Proposition 1 for the basic fragment

B (Π)

, there exist

θ (\vec{v}, \vec{z}) \in B (Π)

and

\vec{c} \in A

, satisfying

A ⊧ θ (\vec{a}, \vec{c})

and

T + E^{B (Π)} – Rule ⊢ θ (\vec{v}, \vec{z}) \to α_{0} (\vec{v}) .

Then,

T + E^{B (Π)} – Rule ⊢ θ (\vec{v}, \vec{z}) \to α (\vec{v})

and, by an application of

E^{B (Π)} – Rule

, we obtain

T + E^{B (Π)} – Rule ⊢ θ (\vec{v}, \vec{z}) \to β (\vec{v}) .

Therefore,

A ⊧ β (\vec{a})

since

A ⊧ T + E^{B (Π)}

–Rule and

A ⊧ θ (\vec{a}, \vec{c})

. □

Please note that conditions (S1) and (S2) of the above Proposition are satisfied by every theory and every set of conditional axioms with suitable syntactical complexity. Namely, if T is a

\forall \exists B (Π)

-axiomatizable theory and E is a set of conditional axioms such that for all

α (\vec{v}) \to β (\vec{v}) \in E

we have

α (\vec{v}) \in \exists \forall B (Π)

and

β (\vec{v}) \in \forall \exists B (Π)

, then T and E satisfy (S1) and (S2). According to this, in what follows we focus on

\forall \exists B (Π)

-axiomatizable theories and sets of conditional axioms of restricted syntactical complexity. This motivates the following definitions:

Definition 6.

A formula

α (\vec{v}) \to β (\vec{v})

is a normal conditional axiom with respect to Π if (modulo logical equivalence)

α (\vec{v}) \in \forall B (Π)

and

β (\vec{v}) \in \forall \exists B (Π)

.

If, instead, for some theory T,

α (\vec{v})

is T-provably equivalent to a

\forall B (Π)

formula and

β (\vec{v})

is T-provably equivalent to a

\forall \exists B (Π)

formula, then we say that

α (\vec{v}) \to β (\vec{v})

is a normal conditional axiom with respect to Π over T.

Remark 3.

In the context of formal arithmetic, there are a good number of combinatorial or logical principles that can be naturally expressed as a set of normal conditional axioms with respect to a suitable basic fragment Π. For instance, the induction principle and the collection or Replacement principles are prominent examples.

In a non-arithmetical context, an interesting example of normal conditional axioms could be the geometric ones (cf. [25]). A geometric axiom is a formula following the geometric axiom scheme below:

\forall \vec{x} (P_{1} (\vec{x}) \land \dots \land P_{n} (\vec{x}) \to \exists {\vec{y}}_{1} M_{1} (\vec{x}, {\vec{y}}_{1}) \lor \dots \lor \exists {\vec{y}}_{m} M_{m} (\vec{x}, {\vec{y}}_{m}))

where each

P_{j}

is an atom and each

M_{i}

is a conjunction of a list of atoms

Q_{i_{1}}, \dots, Q_{i_{ℓ}}

, and none of the variables in any

{\vec{y}}_{i}

are free in the

P_{j}

’s.

It is easy to check that a set of geometric axioms, E, is a set of normal conditional axioms with respect to the basic fragment consisting of the atomic formulas of the language, At.

In view of the discussion preceding Definition 6 and as an immediate corollary of Proposition 2, the following result is obtained.

Corollary 1.

Let Π be a basic fragment and let T be a

\forall \exists B (Π)

-axiomatizable theory. Let E be a set of normal conditional axioms with respect to Π over T. Then,

T + E

is

\forall B (Π)

-conservative over

T + E^{Π}

–Rule.

The previous corollary establishes a broadly applicable conservation result between a set of conditional axioms

T + E

and the naturally associated inference rule

T + E^{Π}

–Rule. Remarkably, in Corollary 1, we only imposed certain syntactical conditions on the quantifier complexity of the involved theories. Therefore, this conservation phenomenon remains independent of the specific combinatorial or mathematical principles that the set of conditional axioms E could express.

Remark 4.

It is important to notice that these conservation results are properties of the given axiomatizations of the theories

T + E

. That is, there are sets of conditional axioms

E_{1}

and

E_{2}

such that

T + E_{1}

and

T + E_{2}

are equivalent theories but the associated inference rules

E^{Π_{1}} – Rule

and

E^{Π_{2}} – Rule

significantly differ in strength.

A set of geometric axioms E can be used to illuminate this aspect of the approach developed in this work. Let us observe that if E is a set of geometric axioms, then each element in E is a conditional axiom

α (\vec{v}) \to β (\vec{v}),

where

α (\vec{v})

is a conjunction of atomic formulas. As a consequence,

\frac{\forall \vec{v} (α (\vec{v}) \to α (\vec{v}))}{\forall \vec{v} (α (\vec{v}) \to β (\vec{v}))}

is an instance of

E^{Π} – Rule

(with

Π = A t

) and it follows that, for every theory T,

[T, E^{Π} – Rule]

is equivalent to

T + E

, rendering trivial every conservation result between both theories. However, let us observe that if we put

D = {\neg β (\vec{v}) \to \neg α (\vec{v}) : α (\vec{v}) \to β (\vec{v}) \in E}

then

T + E \equiv T + D

and D is also a set of normal conditional axioms with respect to Π.

By Proposition 2,

T + E

is

\forall B (Π)

-conservative over

T + D^{Π} – Rule

, but now

T + D^{Π} – Rule

is a theory presumably weaker than

T + E

(since the applications of rule

D^{Π} – Rule

only produce

\forall B (Π)

formulas).

Moreover, Corollary 1 suggests a natural scenario in which the conservativity of

T + E

over the directly associated inference rule

T + E

–Rule can be established, namely when E–Rule and

E^{Π}

–Rule are shown to be equivalent rules. This prompts the following definition.

Definition 7.

Let T be a theory and let E be a set of conditional axioms. We say that

E is weakly $Π$ -reducible modulo T if $E^{Π} – Rule$ is derivable from $E – Rule$ modulo T.
E is $Π$ -reducible modulo T if $E^{Π} – Rule$ is reducible to $E – Rule$ modulo T.

Paradigmatic examples of

Π

-reducible sets of conditional axioms are provided by the different versions of the induction principle in first-order arithmetics, usually formulated by means of a scheme. In particular, the open induction scheme gives us a simple but very clear example that was already studied in the early 1960s by Shepherdson (see [14]). Let us consider a first-order language L extending the language of first-order arithmetics. Let T be a theory in the language L axiomatized by

\forall B (Π)

sentences and let E be the set of conditional sentences generated by the scheme

φ (0, \vec{v}) \land \forall x (φ (x, \vec{v}) \to φ (x + 1, \vec{v})) \to \forall x φ (x, \vec{v})

where

φ (x, \vec{v})

varies within the set

\forall_{0}

of all open formulas of L. Then,

$(★)$ E is $\forall_{0}$ -reducible modulo T.

To see this, it is enough to notice that, given

φ (x, \vec{v}), θ (\vec{v}, \vec{z}) \in \forall_{0}

such that, for some extension U of T,

U ⊢ \forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to φ (0, \vec{v}) \land \forall x (φ (x, \vec{v}) \to φ (x + 1, \vec{v})))

then the sentence

\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to \forall x φ (x, \vec{v}))

can be derived in

[U, E – Rule]

from a single instance of

E – Rule

as follows:

Let

ψ (x, \vec{v}, \vec{z})

be the formula

θ (\vec{v}, \vec{z}) \to φ (x, \vec{v})

. Then,

U ⊢ \forall \vec{v} \forall \vec{z} (ψ (0, \vec{v}) \land \forall x (ψ (x, \vec{v}) \to ψ (x + 1, \vec{v})))

and, therefore,

[U, E – Rule] ⊢ \forall \vec{v} \forall \vec{z} \forall x ψ (x, \vec{v})

, but this last sentence is easily seen to be equivalent to

\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to \forall x φ (x, \vec{v}))

.

In [14], the theory

T + E

was denoted by IAO and

T + E – Rule

by RIO. Bearing in mind Corollary 1 and

(★)

, we obtain an alternative proof of Theorem 2.2 in [14] stating that IAO is

\forall_{1}

-conservative over RIO.

Now we are ready to prove our first general conservation theorem of a theory from

T + E

over the associated inference rule theory

T + E

–Rule. As a by-product, we will also obtain the conservativity of

T + E

over a certain parameter-restricted version of that theory. In fact, given a set of conditional axioms E, we define the set of sentences

U E = {\forall \vec{v} α (\vec{v}) \to \forall \vec{v} β (\vec{v}) : α (\vec{v}) \to β (\vec{v}) \in E}

(U stands for uniform, for in order to apply an axiom of

U E

, the antecedent

α (\vec{v})

must be uniformly true, i.e., true for all values of the parameters

\vec{v}

). It is clear that

T + E

implies

T + U E

, which, in turn, implies

T + E

–Rule

Theorem 1.

Let T be a

\forall \exists B (Π)

-axiomatizable theory and let E be a set of normal conditional axioms with respect to Π over T. If E is weakly Π-reducible modulo T, then

1.: $T + E$ is $\forall B (Π)$ -conservative over $T + E – Rule$ .
2.: $T + E$ is $\exists \forall B (Π)$ -conservative over $T + U E$ .
3.: In fact, if a theory D satisfies that every extension of $T + D$ is closed under $E – Rule$ , then $T + E$ is $\exists \forall B (Π)$ -conservative over $T + D$ .

Proof.

Part (1) directly follows from Proposition 2. Note that conditions (

S 1

) and (

S 2

) in the statement of Proposition 2 are satisfied since E is a set of normal conditional axioms with respect to

Π

over T and

T + E

–Rule and

T + E^{Π}

–Rule are equivalent since E is assumed to be weakly

Π

-reducible modulo T.

Let us prove part (2). Let

ψ \in \exists \forall B (Π)

be a sentence such that

T + U E ⊬ ψ

. Then,

T^{'} = T + U E + \neg ψ

is consistent and

\forall \exists B (Π)

-axiomatizable; hence, by Lemma 1, there exists an

\exists B (Π)

-closed model of

T^{'}

, say

A

.

Observe that

T^{'}

is closed under E–Rule. By weak

Π

-reducibility modulo T and Lemma 3,

T^{'}

is also closed under

E^{B (Π)}

–Rule. Hence, reasoning as in the proof of Proposition 2, we obtain that

A ⊧ T^{'} + E

. In particular,

A ⊧ T + E + \neg ψ

and so

T + E ⊬ ψ

.

As for part (3), let us observe that

T + D

extends

T + U E

(for otherwise there would be

α (\vec{v}) \to β (\vec{v}) \in E

such that

T + D + \forall \vec{v} α (\vec{v}) ⊬ \forall \vec{v} β (\vec{v})

and so

T + D + \forall \vec{v} α (\vec{v})

would not be closed under E-Rule, contradicting the hypothesis on D). Hence, part (3) follows from part (2). □

Remark 5.

Theorem 1 provides a general method for proving the conservativity of a set of axioms E over the associated inference rule E–Rule: (i) expressing E as a set of normal conditional axioms with respect to an appropriate basic fragment Π, and (ii) showing that E is weakly Π-reducible (i.e.,

E^{Π}

–Rule is derivable from E–Rule).

In the realm of arithmetic, significant results can be derived from this approach. For instance, it can be readily demonstrated that the theory of

Σ_{1}

-induction

I Σ_{1}

can be formulated as a set of normal conditional axioms with respect to the basic fragment

Π_{1}

. Subsequently, the resulting

E^{Π_{1}}

–Rule can be derived from (or, more precisely, reduced to)

Σ_{1}

–IR modulo

I Δ_{0}

. This leads to a proof of the well-known fact regarding the

Π_{2}

conservativity of

I Σ_{1}

over

I Δ_{0} + Σ_{1} – IR

. Through this approach, numerous other significant conservation results for arithmetic theories can be proved.

In the setting of formal number theory, a natural question regarding the proof strength of an arithmetic theory T is to characterize the

Γ

consequences of the theory; that is, the set of all theorems of T of a fixed quantifier complexity

Γ

. To fix notation, given a theory T and a set of formulas

Γ

in the language of T, we denote

T h_{Γ} (T) = {φ \in Γ : φ is a sentence and T ⊢ φ} .

Two prototypical results in this regard are the well-known facts that

I Δ_{0} + Σ_{1} – IR

characterizes the

Π_{2}

consequences of

Σ_{1}

-induction

T h_{Π_{2}} (I Σ_{1})

and that

U I Σ_{1}

characterizes

T h_{Σ_{3}} (I Σ_{1})

.

In [17], Kaye already observed that these fundamental facts can be extended to a broader, arithmetic-free context, and that they can be established by using simple model–theoretic arguments. Our Theorem 1 provides an alternative proof of Kaye’s observation. Specifically, suppose that T is a

\forall B (Π)

-axiomatizable theory and that E is a set of conditional axioms satisfying that if

α (\vec{v}) \to β (\vec{v}) \in E

, then both

α (\vec{v})

and

β (\vec{v})

are in

\forall B (Π)

(possibly modulo T). Then,

T + E^{Π}

–Rule is

\forall B (Π)

-axiomatizable and

T + U E

is

\exists \forall B (Π)

-axiomatizable. Therefore, if E satisfies the assumptions of Theorem 1, we obtain characterizations of

T h_{\forall B (Π)} (T)

and

T h_{\exists \forall B (Π)} (T)

. Namely,

T + E^{Π}

–Rule captures, precisely, the

\forall B (Π)

consequences of

T + E

, and

T + U E

captures, precisely, the

\exists \forall B (Π)

consequences of

T + E

.

To close this section, we show how to derive from Theorem 1 another result of Kaye regarding general L-theories. Namely, Theorem 1.1. of [17] establishes that if T is any

\forall_{n + 1}

-axiomatizable L-theory (

n \geq 1

), then

T h_{\exists_{n + 1}} (T) \equiv T h_{B (\exists_{n})} (T) .

Here we obtain a slightly more general version of this result: if

Π

is a basic fragment of a language L and T is a

\forall \exists B (Π)

-axiomatizable L-theory, then

T h_{\exists \forall B (Π)} (T)

and

T h_{B (\exists B (Π))} (T)

coincide (note that Kaye’s result can be recovered by taking

Π = \forall_{n - 1}

).

Theorem 2.

If T is a

\forall \exists B (Π)

-axiomatizable theory, then

T h_{\exists \forall B (Π)} (T) \equiv T h_{B (\exists B (Π))} (T) .

Proof.

Without a loss of generality, it suffices to prove the result under the assumption that the basic fragment

Π

is closed under boolean combinations. Assume

Π = B (Π)

and let T be any

\forall \exists Π

-axiomatized theory. We must show that

T h_{\exists \forall Π} (T)

and

T h_{B (\exists Π)} (T)

are equivalent theories.

First of all, observe that T can be axiomatized by a set of conditional axioms as follows. Let

T_{0}

denote the theory in the language of T with no non-logical axioms and define

R_{T} = {\forall \vec{y} \neg φ (\vec{x}, \vec{y}) \to ⊥ : \forall \vec{x} \exists \vec{y} φ (\vec{x}, \vec{y}) \in T, φ (\vec{x}, \vec{y}) \in Π},

where

⊥ \in \forall Π

denotes the (false) sentence

\forall x (x \neq x)

. It is clear that

T \equiv T_{0} + R_{T}

. Now consider the new set of conditional axioms

E = {(R_{T})}^{Π}

; that is, E is the set of conditional axioms given by

(θ (\vec{x}, \vec{z}) \to \forall \vec{y} \neg φ (\vec{x}, \vec{y})) \to (θ (\vec{x}, \vec{z}) \to ⊥),

where

θ (\vec{x}, \vec{z}) \in Π

,

\forall \vec{x} \exists \vec{y} φ (\vec{x}, \vec{y}) \in T

, and

φ (\vec{x}, \vec{y}) \in Π

. We also have

T \equiv T_{0} + E

and now it is immediate to verify that E is a set of normal conditional axioms with respect to

Π

, which, in addition, is

Π

-reducible modulo

T_{0}

. By Theorem 1, we obtain that T is

\exists \forall Π

-conservative over

T_{0} + U E

, which, by definition, is given by the set of sentences

\forall \vec{x} \forall \vec{z} (θ (\vec{x}, \vec{z}) \to \forall \vec{y} \neg φ (\vec{x}, \vec{y})) \to \forall \vec{x} \forall \vec{z} (θ (\vec{x}, \vec{z}) \to ⊥),

where

θ (\vec{x}, \vec{z}) \in Π

,

\forall \vec{x} \exists \vec{y} φ (\vec{x}, \vec{y}) \in T

, and

φ (\vec{x}, \vec{y}) \in Π

. But it is straightforward to check that

U E

can be rewritten as a set of sentences which, modulo logical equivalence, are in

B (\exists Π)

. Namely,

\exists \vec{x} \exists \vec{z} θ (\vec{x}, \vec{z}) \to \exists \vec{x} \exists \vec{z} \exists \vec{y} (θ (\vec{x}, \vec{z}) \land φ (\vec{x}, \vec{y})),

where

θ (\vec{x}, \vec{z}) \in Π

,

\forall \vec{x} \exists \vec{y} φ (\vec{x}, \vec{y}) \in T

, and

φ (\vec{x}, \vec{y}) \in Π

. Consequently,

T h_{B (\exists Π)} (T)

implies

T_{0} + U E

, which, in turn, implies

T h_{\exists \forall Π} (T)

by conservativity. For the opposite direction, observe that (modulo logical equivalence) every

B (\exists Π)

sentence can be rewritten as an

\exists \forall Π

sentence. □

5. Finite Sets of Conditional Sentences

In the previous section, we obtained a number of conservation theorems of

T + E

over

T + E – Rule

or

T + E^{Π} – Rule

for a general set of normal conditional axioms E. In this section, we will prove finer conservation results for the particular case where E is a finite set of normal sentences. In other words, we are interested in cases where E can be expressed as

{α_{1} \to β_{1}, \dots, α_{m} \to β_{m}},

where

m \in ω

and all

α_{i}

s and

β_{i}

s are sentences, i.e., they contain no free variables.

Again, the original motivation for considering these particular sets of conditional axioms comes from results in the context of formal arithmetics. A well-studied fragment of first-order Peano arithmetic is the scheme of parameter-free

Σ_{1}

-induction

I Σ_{1}^{-}

given by a basic algebraic theory

P^{-}

together with

I_{φ} : φ (0) \land \forall x (φ (x) \to φ (x + 1)) \to \forall x φ (x),

where

φ (x) \in Σ_{1}^{-}

; that is,

φ (x) \in Σ_{1}

and contains no other free variables than the induction variable x. Note that

I Σ_{1}^{-}

can be seen as a set of normal conditional sentences with respect to

Π = Π_{1}

. It is well-known that

I Σ_{1}

and its parameter-free counterpart

I Σ_{1}^{-}

share the same

Π_{2}

consequences (indeed, the

Σ_{3}

consequences are also preserved), but

I Σ_{1}^{-}

enjoys the following nice property:

Let $θ$ be a $Π_{2}$ sentence. If for some $φ_{1} (x), \dots, φ_{m} (x) \in Σ_{1}^{-}$ we have $I Δ_{0} + I_{φ_{1}} + \dots + I_{φ_{m}} ⊢ θ$ , then ${[I Δ_{0}, Σ_{1} – IR]}_{m} ⊢ θ$ .

The previous property is a well-known conservation theorem for fragments of arithmetic obtained (independently) by Z. Adamowicz, T. Bigorajska, G. Mints, and also by Z. Ratajczyk. The result generalizes to

I Σ_{n}^{-}

for each

n \geq 1

, but we cannot expect to have a similar result for (parametric)

I Σ_{1}

since

I Σ_{1}

is well-known to be finitely axiomatizable.

At first sight, the previous result for

I Σ_{1}^{-}

could seem to be a very particular property of the induction scheme in the formal arithmetic setting. However, and this was already observed by Kaye in [17], this property again corresponds to a very general purely logical fact for theories described in terms of conditional sentences, see theorem 1.4 in [17] (let us observe that similar results in the context of bounded arithmetic theories have been obtained by Jeřábek in [26].)

In the present section, we shall prove a (slightly more general) version of theorem 1.4 in [17] using our methodology. Namely, we shall obtain a conservation theorem relating the number of conditional sentences needed to derive a

\forall B (Π)

formula from E and the depth of the nested applications of the corresponding

E^{Π} – Rule

, see Theorem 3 below.

Through this section,

Π

will denote an arbitrary basic fragment. We shall begin with an analysis of

E^{Π} – Rule

when E is a set of conditional sentences but E is not necessarily a finite set. Let us observe that, since E is a set of sentences and, by Lemma 3,

E^{Π} – Rule

is congruent with

E^{B (Π)} – Rule

, it is straightforward to check that

E^{Π} – Rule

is congruent with the following rule (that we shall denote by

E_{\forall}^{Π} – Rule

):

\frac{θ \to α}{θ \to β} (for each sentence θ \in \exists B (Π) and α \to β \in E) .

This motivates the introduction of a kind of dual version of

E^{Π} – Rule

that we shall denote by

E_{\exists}^{Π} – Rule

. Instances of this new rule are

\frac{θ \to α}{θ \to β} (for each sentence θ \in \forall B (Π) and α \to β \in E) .

Please notice that the subscript ∃ in the name of the

E_{\exists}^{Π}

–Rule indicates that, as shown in Theorem 3 below, under certain assumptions,

T + E

is

\exists B (Π)

-conservative over applications of this rule. Similarly, the subscript ∀ in the name of the

E_{\forall}^{Π}

–Rule indicates that, under certain assumptions,

T + E

is

\forall B (Π)

-conservative over applications of

E_{\forall}^{Π}

–Rule.

Lemma 4.

Let T be a theory and let E be a set of conditional sentences.

1.: For every sentence $σ \in \exists B (Π)$ , $[T + σ, E^{Π} – Rule] \equiv [T, E^{Π} – Rule] + σ$ .
2.: For every sentence $τ \in \forall B (Π)$ , $[T + τ, E_{\exists}^{Π} – Rule] \equiv [T, E_{\exists}^{Π} – Rule] + τ .$

Proof.

(1) We only prove that, for all sentences

σ \in \exists B (Π)

,

[T, E^{Π} – Rule] + σ

extends

[T + σ, E^{Π} – Rule]

(the opposite direction is trivial).

Let

θ (\vec{u}) \in {(Π \cup \neg Π)}^{\land}

be such that

T + σ ⊢ \forall \vec{u} (θ (\vec{u}) \to α)

, for some sentence

α \to β \in E

. Since

σ

is

\exists \vec{y} σ_{0} (\vec{y})

for some

σ_{0} (\vec{y}) \in B (Π)

(and we can assume that the variables in

\vec{y}

are all different from the ones in

\vec{u}

), we obtain

T ⊢ \forall \vec{y} \forall \vec{u} (σ_{0} (\vec{y}) \land θ (\vec{u}) \to α) .

But recall that, by Lemma 3,

E^{Π} – Rule

is congruent with

E^{B (Π)} – Rule

, and therefore it follows that

[T, E^{Π} – Rule] ⊢ \forall \vec{y} \forall \vec{u} (σ_{0} (\vec{y}) \land θ (\vec{u}) \to β)

. Thus,

[T, E^{Π} – Rule] ⊢ σ \to \forall \vec{u} (θ (\vec{u}) \to β)

and the result follows.

(2) We prove that, given

τ \in \forall B (Π)

,

[T, E_{\exists}^{Π} – Rule] + τ

extends

[T + τ, E_{\exists}^{Π} – Rule]

.

Let

θ \in \forall B (Π)

be a sentence such that

T + τ ⊢ θ \to α

for some

α \to β \in E

. Since

θ

and

τ

are (respectively)

\forall \vec{u} θ_{0} (\vec{u})

and

\forall \vec{y} τ_{0} (\vec{y})

for some

τ_{0} (\vec{y}), θ_{0} (\vec{u}) \in B (Π)

(and we can assume that the variables in

\vec{y}

are all different from the ones in

\vec{u}

), we obtain

T ⊢ \forall \vec{y} \forall \vec{u} (τ_{0} (\vec{y}) \land θ_{0} (\vec{u})) \to α .

So,

[T, E_{\exists}^{Π} – Rule] ⊢ τ \to (θ \to β)

, and the result follows. □

The first interesting fact about sets of conditional sentences is the following improvement of Proposition 2.

Lemma 5.

Let E be a set of conditional sentences such that if

α \to β \in E

, then

α \in B (\forall B (Π))

. Then, for every theory T,

T + E

is

\forall B (Π)

-conservative over

T + E^{Π} – Rule

.

Proof.

Let

T_{1}

denote the theory axiomatized by

T h_{\forall B (Π)} (T + E^{Π} – Rule)

. We shall prove that every

\exists B (Π)

-closed model of

T_{1}

is a model of

T h_{\forall B (Π)} (T + E)

. Thus, the result will follow from Lemma 2.

Let

A

be an

\exists B (Π)

-closed model of

T_{1}

. First of all, note that

(•) $(T + E^{Π} – Rule) + D_{B (Π)} (A)$ is consistent.
Proof of (•): For otherwise by compactness, there would exist $δ (\vec{v}) \in B (Π)$ and $\vec{a} \in A$ satisfying $A ⊧ δ (\vec{a})$ and $T + E^{Π} – Rule ⊢ \forall \vec{v} \neg δ (\vec{v})$ . Since $\forall \vec{v} \neg δ (\vec{v}) \in \forall B (Π)$ , we would have $A ⊧ \forall \vec{v} \neg δ (\vec{v})$ , a contradiction.

Hence, there exists

B ⊧ T + E^{Π} – Rule

such that

A ≺_{B (Π)} B

. Let us show that

B ⊧ E

.

Pick

α \to β \in E

such that

B ⊧ α

. It follows from

A ≺_{B (Π)} B

and the fact that

A

is an

\exists B (Π)

-closed model of

T_{1}

that

A ≺_{\exists B (Π)} B

. Since

α \in B (\forall B (Π))

and

A ≺_{\exists B (Π)} B

, we also have

A ⊧ α

. But note that every

B (\forall B (Π))

sentence can be rewritten (modulo logical equivalence) as an

\exists \forall B (Π)

sentence.

Therefore, by applying Proposition 1, we obtain that there exist

\vec{a} \in A

and

θ (\vec{z}) \in B (Π)

such that

A ⊧ θ (\vec{a})

and

T_{1} ⊢ \exists \vec{z} θ (\vec{z}) \to α

. Then,

T + E^{Π} – Rule ⊢ \exists \vec{z} θ (\vec{z}) \to β

(recall that

E^{Π}

and

E^{B (Π)}

are congruent rules) and so

B ⊧ \exists \vec{z} θ (\vec{z}) \to β

. But,

B ⊧ \exists \vec{z} θ (\vec{z})

since

A ≺_{B (Π)} B

. Therefore,

B ⊧ β

, as required.

We have thus shown that there is

B ⊧ T + E

such that

A ≺_{B (Π)} B

and, therefore,

A ⊧ T h_{\forall B (Π)} (T + E)

, as required. □

We now turn to the study of the case where E is a finite set of conditional sentences. First, we introduce some notation. If

E = {ψ}

, where

ψ

is a conditional sentence, then

E^{Π} – Rule

will be denoted by

ψ^{Π} – Rule

and

E_{\exists}^{Π} – Rule

will be denoted by

ψ_{\exists}^{Π} – Rule

. The next lemma shows that for every conditional sentence

ψ

, nested applications of

ψ^{Π} – Rule

(or

ψ_{\exists}^{Π} – Rule

) collapse to unnested applications of the rule.

Lemma 6.

Assume ψ is a conditional sentence of the form

α \to β

.

1.: If $α \in \forall B (Π)$ , then $[T, ψ^{Π} – Rule] \equiv T + ψ^{Π} – Rule$ .
2.: If $α \in \exists B (Π)$ , then $[T, ψ_{\exists}^{Π} – Rule] \equiv T + ψ_{\exists}^{Π} – Rule .$

Proof.

(1): By Lemma 3,

ψ^{Π} – Rule

and

ψ^{B (Π)} – Rule

are congruent, and thus we can assume, without loss of generality, that

Π

is closed under boolean combinations (that is,

Π = B (Π)

). In addition, we also take advantage of the fact that, since we are dealing with conditional sentences, each instance of

ψ^{Π} – Rule

can be easily transformed into an equivalent instance of

ψ_{\forall}^{Π}

. Thus, in the following proofs, we shall deal with instances of

ψ_{\forall}^{Π} – Rule

, although we refer to them as instances of

ψ^{Π} – Rule

.

First, we show that

k \geq 1

unnested applications of

ψ^{Π} – Rule

can be replaced by a single unnested application of this rule: Let

θ

be an

\exists Π

sentence equivalent to

⋁_{i = 1}^{k} θ_{i}

, where

θ_{1}, \dots, θ_{k} \in \exists Π

are sentences such that for each

i = 1, \dots, k

, we have

T ⊢ θ_{i} \to α

. Then,

T ⊢ θ \to α

, and as a consequence, since for each

i = 1, \dots, k

,

T ⊢ (θ \to β) \to (θ_{i} \to β),

we obtain that these k applications of

ψ^{Π} – Rule

corresponding to

θ_{1}, \dots, θ_{k}

can be replaced by the instance given by the sentence

θ

.

Now we show how to deal with nested applications of

ψ^{Π} – Rule

. Let

θ_{1}, θ_{2} \in \exists Π

be sentences such that

T ⊢ θ_{1} \to α, and T + (θ_{1} \to β) ⊢ θ_{2} \to α .

Let

θ \in \exists Π

be a sentence equivalent to

θ_{1} \lor θ_{2}

. Let us see that

T ⊢ θ \to α

.

Indeed, we argue using T and assume that

θ

holds. Firstly, if

θ_{1}

holds, then we obtain

α

since

T ⊢ θ_{1} \to α

by our hypothesis. Secondly, if

θ_{1}

does not hold, then we have

θ_{1} \to β

and

θ_{2}

. But, since by hypothesis

T + (θ_{1} \to β) ⊢ θ_{2} \to α

, it follows that

θ_{2} \to α

. As a consequence, we obtain

α

again, as required.

We showed that

T ⊢ θ \to α

and therefore one application of

ψ^{Π} – Rule

is enough to derive

θ \to β

, which is equivalent to

(θ_{1} \to β) \land (θ_{2} \to β)

. So, two nested applications of

ψ^{Π} – Rule

can be replaced by one unnested application of the rule, and the equivalence between

[T, ψ^{Π} – Rule]

and

T + ψ^{Π} – Rule

follows.

(2) A straightforward modification of the previous proof allows us to derive part (2). We must notice that

ψ_{\exists}^{Π} – Rule

is congruent with

ψ_{\exists}^{B (Π)} – Rule

, as can be easily seen. □

Now we consider a finite set

E = {ψ_{1}, \dots, ψ_{m}}

of conditional sentences. In this case, applications of the corresponding

E^{Π} – Rule

(respectively,

E_{\exists}^{Π} – Rule

) can be described in terms of the set of rules

{ψ_{j}}^{Π} – Rule

(respectively,

{(ψ_{j})}_{\exists}^{Π} – Rule

),

j = 1, \dots, m

. We shall study the interaction of these m rules and derive a collapse result.

Proposition 3.

Let

E = {ψ_{1}, \dots, ψ_{m}}

be a finite set of conditional sentences with cardinal m. Then, for every theory T,

1.: If for every $α \to β \in E$ , $α \in \forall B (Π)$ , then $T + E^{Π} – Rule \equiv {[T, E^{Π} – Rule]}_{m}$ .
2.: If for every $α \to β \in E$ , $α \in \exists B (Π)$ , then $T + E_{\exists}^{Π} – Rule \equiv {[T, E_{\exists}^{Π} – Rule]}_{m}$ .

Proof.

(1) As we pointed out in the proof of Lemma 6, we can assume, without loss of generality, that

Π

is closed under boolean combinations. In addition, we deal with instances of

ψ^{Π} – Rule

as instances of

ψ_{\forall}^{Π} – Rule

. Now, the proof proceeds by induction on

m \geq 1

, the number of sentences in the set E.

$m = 1$ : In this case, the result is just part (1) in Lemma 6.
$m \to m + 1$ : Consider $E = {ψ_{1}, \dots, ψ_{m}, ψ_{m + 1}}$ where for each $j, 1 \leq j \leq m + 1$ , $ψ_{j}$ is a conditional sentence $α_{j} \to β_{j}$ with $α_{j} \in \forall B (Π)$ . We assume that the result holds for every set of conditional sentences with cardinal m.

In order to derive the result for E, it is enough to show that

{[T, E^{Π} – Rule]}_{m + 1}

is closed under

ψ_{j}^{Π} – Rule

for each

j = 1, \dots, m + 1

. Let us assume that

{[T, E^{Π} – Rule]}_{m + 1} ⊢ θ \to α_{p}

for some p

(1 \leq p \leq m + 1)

and

θ \in \exists Π

. Put

D = E - {ψ_{p}}

. It is enough to show that

{[T, D^{Π} – Rule]}_{m} ⊢ θ \to α_{p}

, since then it will follow that

{[T, E^{Π} – Rule]}_{m + 1} ⊢ θ \to β_{p}

, as required.

We observe that

\neg α_{p}

is equivalent to an

\exists B (Π)

sentence and, as a consequence, by Lemma 4,

{[T, D^{Π} – Rule]}_{m} + \neg α_{p} \equiv {[T + \neg α_{p}, D^{Π} – Rule]}_{m} .

Now the crucial point is the following fact:

Claim: $(T + \neg α_{p}) + D^{Π} – Rule e x t e n d s T + E^{Π} – Rule .$
Proof of Claim: We shall prove this by showing, by induction on $k \geq 1$ , that for all $k \geq 1$ , $(T + \neg α_{p}) + D^{Π} – Rule$ extends ${[T, E^{Π} – Rule]}_{k}$ :
- $•$ For $k = 1$ , this is straightforward in view of the following easy fact: if S extends T and $S ⊢ σ \to α_{p}$ for some sentence $σ \in \exists Π$ , then $S + \neg α_{p} ⊢ σ \to β_{p}$ .
- $•$ Assume that for some $k \geq 1$ the result holds, and assume that
  
  ${[T, E^{Π} – Rule]}_{k} ⊢ σ \to α_{j}$
  
  for some j, $(1 \leq j \leq m + 1)$ and $σ \in \exists Π$ . By the induction hypothesis on k,
  
  $(T + \neg α_{p}) + D^{Π} – Rule ⊢ σ \to α_{j}$
  
  and we can now distinguish two cases:
  ·
  If $j \neq p$ , then obviously $(T + \neg α_{p}) + D^{Π} – Rule ⊢ σ \to β_{j} .$
  ·
  If $j = p$ , then, as in case $k = 1$ , ${[T, E^{Π} – Rule]}_{k} + \neg α_{p} ⊢ σ \to β_{p}$ . But, by the induction hypothesis on k, $(T + \neg α_{p}) + D^{Π} – Rule$ extends ${[T, E^{Π} – Rule]}_{k}$ ; so,
  
  $(T + \neg α_{p}) + D^{Π} – Rule ⊢ σ \to β_{p}$
  
  as required.
- This proves the claim. □

Consider

A ⊧ {[T, D^{Π} – Rule]}_{m}

. If

A ⊧ α_{p}

, then, obviously,

A ⊧ θ \to α_{p}

; hence, let us assume that

A ⊧ \neg α_{p}

. Then,

A ⊧ {[T, D^{Π} – Rule]}_{m} + \neg α_{p} .

Now, by the induction hypothesis (on m),

{[T, D^{Π} – Rule]}_{m} \equiv T + D^{Π} – Rule

, and so

A ⊧ (T + \neg α_{p}) + D^{Π} – Rule .

In view of the claim above, we obtain

A ⊧ {[T, E^{Π} – Rule]}_{m + 1}

, and therefore,

A ⊧ θ \to α_{p}

. This shows that

{[T, D^{Π} – Rule]}_{m} ⊢ θ \to α_{p}

and concludes the proof.

(2): It is easy to adapt the proof of item (1). We omit the details. □

Remark 6.

Let us note that Proposition 3 also holds for every finite set of conditional sentences, E, satisfying that for all

α \to β \in E

,

α \in \forall B (Π) \cup \exists B (Π)

.

For instance, we can deal with part (2) of Proposition 3 just by putting

E_{1} = {α \to β \in E : α \in \exists B (Π)}, a n d E_{2} = {α \to β \in E : α \in \forall B (Π)}

and having in mind that for all T, if

α \to β \in E_{2}

, then we obtain

[T, {(E_{2})}_{\exists}^{Π} – Rule] ⊢ α \to β

, since

α \in \forall B (Π)

and

T ⊢ α \to α

. Therefore,

[T, {(E_{2})}_{\exists}^{Π} – Rule] \equiv T + E_{2}

, and it follows that

{[T, E_{\exists}^{Π} – Rule]}_{m} \equiv {[[T, {(E_{2})}_{\exists}^{Π} – Rule], {(E_{1})}_{\exists}^{Π} – Rule]}_{m_{1}} \equiv T + E_{\exists}^{Π} – Rule

where

m_{1} = | E_{1} |

. We can deal with part (1) in a similar way.

We are now in a position to prove the main result of the present section. From Proposition 3 and Lemma 5, we derive the following version of theorem 1.4 of [17].

Theorem 3.

Let T be a theory and let E be a finite set of conditional sentences such that for every

α \to β \in E

,

α \in \forall B (Π) \cup \exists B (Π)

. Let m be the number of elements of E. Then,

1.: $T + E$ is $\forall B (Π)$ -conservative over ${[T, E^{Π} – Rule]}_{m}$ .
2.: $T + E$ is $\exists B (Π)$ -conservative over ${[T, E_{\exists}^{Π} – Rule]}_{m}$ .

Proof.

In view of Remark 6, part (1) follows from Lemma 5 and Proposition 3.

We give a direct proof for part (2).

Let

φ \in \exists B (Π)

be a sentence such that

T + E ⊢ φ

. We shall prove that

{[T, E_{\exists}^{Π} – Rule]}_{m} ⊢ φ

by induction on

m \geq 1

:

m = 1

: If

E = {α \to β}

and

T + (α \to β) ⊢ φ

, then

T ⊢ \neg φ \to \neg (α \to β)

. As a consequence,

T ⊢ (\neg φ \to α) \land (\neg φ \to \neg β)

. Since

\neg φ \in \forall B (Π)

, we obtain

[T, E_{\exists}^{Π} – Rule] ⊢ \neg φ \to β

and, therefore,

[T, E_{\exists}^{Π} – Rule] ⊢ φ

.

m \to m + 1

: Let

E = {α_{1} \to β_{1}, \dots, α_{m + 1} \to β_{m + 1}}

be a set of sentences. First, we consider the case where

α_{j} \in \exists B (Π)

for all

j = 1, \dots, m + 1

. Let

φ \in \exists B (Π)

be a sentence such that

T + ⋀_{j = 1}^{m + 1} (α_{j} \to β_{j}) ⊢ φ

. Then,

T ⊢ \neg φ \to \neg ⋀_{j = 1}^{m + 1} (α_{j} \to β_{j})

and, in particular, it follows that

T ⊢ \neg φ \to ⋁_{j = 1}^{m + 1} \neg β_{j}

. For each

l = 1, \dots, m + 1

, let

T_{l}

be the theory

T + E_{l}

where

E_{l} = E - {α_{l} \to β_{l}}

. Then,

T_{l} + (α_{l} \to β_{l}) ⊢ φ

, and reasoning as in case

m = 1

, we obtain

T_{l} ⊢ \neg φ \to α_{l}

. If

α_{l} \in \exists B (Π)

, then by the induction hypothesis,

{[T, E_{\exists}^{Π} – Rule]}_{m} ⊢ \neg φ \to α_{l}

. We proved in this way that for all l

(1 \leq l \leq m + 1)

,

{[T, E_{\exists}^{Π} – Rule]}_{m + 1} ⊢ \neg φ \to β_{l}

. As a consequence,

{[T, E_{\exists}^{Π} – Rule]}_{m + 1} ⊢ \neg φ \to ⋀_{j = 1}^{m + 1} β_{j}

and it follows that

{[T, E_{\exists}^{Π} – Rule]}_{m + 1} ⊢ φ

.

For the general case, we put

E_{1} = {α \to β \in E : α \in \exists B (Π)} a n d E_{2} = {α \to β \in E : α \in \forall B (Π)}

and observe that

T + E_{2} \equiv [T, {(E_{2})}_{\exists}^{Π} – Rule]

. Then, by the previous restricted case,

T + E

is

\exists B (Π)

-conservative over

{[T + E_{2}, {(E_{1})}_{\exists}^{Π} – Rule]}_{m_{1}}

(where

m_{1} = | E_{1} |

). Hence,

T + E

is

\exists B (Π)

-conservative over

{[[T, {(E_{2})}_{\exists}^{Π} – Rule], {(E_{1})}_{\exists}^{Π} – Rule]}_{m_{1}}

which is, obviously, a subtheory of

{[T, E_{\exists}^{Π} – Rule]}_{m}

. □

Corollary 2.

Let E be a set of normal conditional axioms with respect to Π over T such that E is Π-reducible modulo T. If F is a finite subset of

U E

with m elements or F is a finite set of m sentences included in E, then

T h_{\forall B (Π)} (T + F) \subseteq {[T, E – Rule]}_{m} .

6. Conclusions and Future Work

Both in pure and in applied logic, the question of whether it is more convenient to formalize a certain mathematical principle as an axiom or as an inference rule is important and ubiquitous.

In this work, we developed a general logical framework that allows for replacing axioms with corresponding inference rules without greatly affecting the proof–theoretical strength, preserving theorems up to a certain level of quantifier complexity. While these results are familiar to logicians working in arithmetic formal theories, we believe they could also benefit a broader audience in logic.

The proof methods we use are conceptually very simple: we essentially combine syntactical manipulations and basic model–theoretic constructions. This should make the article accessible to a wide audience.

Several avenues for future research suggest themselves. Firstly, it is natural to ask whether the main results of the present paper also hold for other settings different from classical first-order logics (such as, for example, intuitionistic logic or minimal logic). In order to explore this line of future work, one should have to isolate first a suitable notion that could play the role of the

\exists Π

-closed models for these new settings. Secondly, and still from a theoretical point of view, it would also be of interest to explore possible applications of the obtained results to other formal theories different from the arithmetical ones (for example, theories axiomatized by geometric axioms could serve as a first field of study). Finally, from an applied perspective, it would be desirable to investigate possible applications to rule-based reasoning in computational logic. In areas such as logic programming, expert systems, or the Semantic Web, rule-based knowledge representation is a crucial and powerful tool, and implementations of rule-based systems naturally emerge. A formal analysis of the axioms-as-rules strategy could be of interest in these fields.

We close this paper with some pointers to the kind of applications that we think deserve further exploration.

6.1. Description Logics

Conditional axioms are frequently used as "general axioms" in description logics. For example, in

ALC

logic, consider the axiom

\exists r_{1} . \forall r_{2} A ⊑ \forall s_{1} . \exists s_{2} . B

which, when translated to first-order logic (with A and B being concept names translated to unary predicates and roles r,

s_{1}

, and

s_{2}

as binary relations), becomes

\forall x (\exists y_{1} (r_{1} (x, y_{1}) \land \forall y_{2} (r (y_{1}, y_{2}) \to A (y_{2}))) \to \forall z_{1} (s_{1} (x, z_{1}) \to \exists z_{2} (s_{2} (z_{1}, z_{2}) \land B (z_{2}))))

which is equivalent to the normal condition axiom (defined in Section 4):

\exists y_{1} \forall y_{2} (r_{1} (x, y_{1}) \land (r (y_{1}, y_{2}) \to A (y_{2}))) \to \forall z_{1} \exists z_{2} (s_{1} (x, z_{1}) \to (s_{2} (z_{1}, z_{2}) \land B (z_{2})))

An example of the utility of introducing rules would be as a mechanism for simplification through design patterns. Consider the following example extracted from [27]:

\begin{matrix} O_{1} = {Jaguar ⊑ Animal, & Jaguar ⊑ \forall hasChild . Jaguar, \\ Tiger ⊑ Animal, & Tiger ⊑ \forall hasChild . Tiger, \\ Lion ⊑ Animal, & Lion ⊑ \forall hasChild . Lion} \end{matrix}

Note that, since it is not possible to quantify over concepts in typical description logics, expressing the fact that every subclass of animals only has a child of the same subclass is not feasible. Nevertheless, by employing a certain type of (meta)rules [27], it is possible to obtain a representation of such information, which allows for the elimination of certain axioms from the ontology:

O_{2} = {Jaguar ⊑ Animal, Tiger ⊑ Animal, Lion ⊑ Animal}

g : \underset{Body}{\underset{︸}{{? X ⊑ Animal}}} \to \underset{Head}{\underset{︸}{{? X ⊑ \forall hasChild . ? X}}},

which could be added to the standard tableau-based consistency class algorithm as a rule that acts on classes:

R : \frac{a : ? X ⊑ Animal}{a : ? X ⊑ \forall hasChild . ? X}

The question that arises with this transformation is what degree of conservativity is maintained between the original ontology

O_{1}

and the new representation

O_{2} + R

.

In the framework of first-order logic, we could deal with g as an axiom scheme, where X ranges over a convenient class of formulas. In this way, we could obtain a set of conditional axioms E and applications of the (meta)rule R that will correspond to applications of the

E – Rule

. Nevertheless, although we do not discard that some conservation results could be derived by a more or less direct application of the machinery developed in this paper, the chief question here would be: Is it possible to derive, in the setting of description logics, some results and techniques similar to the ones developed in this paper for first-order theories? In particular, can such an approach be useful in settling the exact conservation between the ontologies

O_{1}

and

O_{2} + R

(or other similar cases)?

6.2. Coherent/Geometric Logic

In Remark 3, we briefly discussed the notion of a (finitary) geometric axiom (also known as a coherent formula). There we pointed out that a set E of geometric axioms provides us with an example of a normal set of conditional axioms with respect to the basic fragment

Π

of all atomic formulas of the language. We also noticed that if we put

D = {\neg β (\vec{v}) \to \neg α (\vec{v}) : α (\vec{v}) \to β (\vec{v}) \in E}

then

T + E \equiv T + D

, and D is also a set of normal conditional axioms with respect to

Π

. Therefore, by Proposition 2,

T + E

is

\forall B (Π)

-conservative over

T + D^{Π} – Rule

.

It is a theorem of Barr that if a geometric sentence is derivable from a geometric theory using classical (first-order) logic, then it is also derivable using intuitionistic logic. As noted in [25], this result can be easily derived using a cut–elimination argument. However, can the model–theoretic methods that we presented in this paper be adapted to derive Barr’s theorem or some related result?

Towards a positive answer to this problem, the following question could be considered. In [28], Coste, Lombardi, and Roy deal with dynamical theories axiomatized by geometric axioms (in [28], the term "dynamical axiom" is used to refer to geometric axioms). A key notion in that work is the notion of a dynamical proof (closely related to a derivation of a basic sequence in a cut-free system with mathematical rules, as pointed out in [25]). Theorem 1.1 in [28] shows that there is a construction associating a dynamical proof to each classical proof of an atomic formula B from a set of atomic formulas R. Is it possible to transform a proof of a geometric sentence in

T + D^{Π} – Rule

in (some kind of) a dynamical proof?

6.3. Inference Rules and Automated Reasoning

The examples we just mentioned are interesting not only from a theoretical point of view. The inference rules we described in these examples can be implemented as tools for automated reasoning. Automated theorem proving is an important research field with applications in Artificial Intelligence and Program Verification (just to mention two application areas). Inference rules provide the base for efficient implementations of the reasoning principles expressed by conditional axioms. In addition to providing a guide to the search for appropriate inference rules, the results presented in this paper can play a fundamental role in the analysis of the formal properties (correctness, completeness, conservativeness,…) of the systems finally developed.

A classic example of such use is provided by the Boyer–Moore Nqthm theorem prover [29] and the closely related system ACL2 [30], developed by Kaufmann and Moore, used in the modeling and verification of computer hardware and software. In these systems, the (noetherian) induction principle is implemented as an inference rule that provides a powerful tool to derive properties of functions defined by recursion. Model–theoretic methods play an important role in the proofs of the correctness properties of ACL2, as shown in [31].

Author Contributions

Conceptualization, F.F.L.-M., A.C.-F. and J.B.-D.; methodology, F.F.L.-M. and A.C.-F.; formal analysis, F.F.L.-M. and A.C.-F.; investigation, F.F.L.-M. and A.C.-F.; resources, J.B.-D.; writing—original draft preparation, F.F.L.-M., A.C.-F. and J.B.-D.; writing—review and editing, F.F.L.-M., A.C.-F. and J.B.-D.; supervision, F.F.L.-M., A.C.-F. and J.B.-D.; project administration, J.B.-D.; funding acquisition, J.B.-D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Projects MTM-PID2020-116773GB-I00 and PID2019-109152G, MCIN/AEI/10.13039/501100011033, both funded by the spanish State Investigation Agency (Agencia Estatal de Investigación).

Data Availability Statement

Data are contained within the article.

Acknowledgments

We thank the anonymous reviewers for their suggestions, which have helped to clarify the content of the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Burel, G. From Axioms to Rewriting Rules. Manuscript. Available online: https://web4.ensiie.fr/~guillaume.burel/download/burel20axioms.pdf (accessed on 20 February 2024).
Aoto, T.; Stratulat, S. Decision procedures for proving inductive theorems without induction. In Proceedings of the PPDP ’14: 16th International Symposium on Principles and Practice of Declarative Programming, Canterbury, UK, 8–10 September 2014; pp. 237–248. [Google Scholar] [CrossRef]
Stratulat, S. Mechanically certifying formula-based Noetherian induction reasoning. J. Symb. Comput. 2017, 80, 209–249. [Google Scholar] [CrossRef]
Vickers, S. Geometric logic in computer science. In Theory and Formal Methods 1993, Proceedings of the First Imperial College Department of Computing Workshop on Theory and Formal Methods, Isle of Thorns Conference Centre, Chelwood Gate, Sussex, UK, 29–31 March 1993; Burn, G., Gay, S., Ryan, M., Eds.; Springer: London, UK, 1993. [Google Scholar] [CrossRef]
Negri, S. Geometric Rules in Infinitary Logic. In Arnon Avron on Semantics and Proof Theory of Non-Classical Logics; Arieli, O., Zamansky, A., Eds.; Springer: Cham, Switzerland, 2021. [Google Scholar] [CrossRef]
Sieg, W. Fragments of Arithmetic. Ann. Pure Appl. Log. 1985, 28, 33–71. [Google Scholar] [CrossRef]
Sieg, W. Herbrand Analyses. Arch. Math. Log. 1991, 30, 409–441. [Google Scholar] [CrossRef]
Buss, S. Bounded Arithmetic; Studies in Proof Theory, Bibliopolis; Princeton University: Princeton, NJ, USA, 1986. [Google Scholar]
Buss, S.R. The witness function method and provably recursive functions of peano arithmetic. In Logic, Methodology and Philosophy of Science IX; Studies in Logic and the Foundations of Mathematics; Prawitz, D., Skyrms, B., Westerståhl, D., Eds.; Elsevier: Amsterdam, The Netherlands, 1995; Volume 134, pp. 29–68. [Google Scholar] [CrossRef]
Avigad, J. Saturated models of universal theories. Ann. Pure Appl. Log. 2002, 118, 219–234. [Google Scholar] [CrossRef]
Parsons, C. On n-quantifier induction. J. Symb. Log. 1972, 37, 466–482. [Google Scholar] [CrossRef]
Beklemishev, L. A proof-theoretic analysis of collection. Arch. Math. Log. 1998, 37, 275–296. [Google Scholar] [CrossRef]
Beklemishev, L. On the induction schema for decidable predicates. J. Symb. Log. 2003, 68, 17–34. [Google Scholar] [CrossRef]
Shepherdson, J.C. Non-standard models for fragments of number theory. In The Theory of Models, Proceedings of the 1963 International Symposium at Bekerley; Addison, J.W., Henkin, L., Tarsky, A., Eds.; North-Holland: Amsterdam, The Netherlands, 1965. [Google Scholar]
Kaye, R. Diophantine and Parameter—Free Induction. Ph.D. Thesis, University of Manchester, Manchester, UK, 1987. [Google Scholar]
Kaye, R. Parameter free induction in arithmetic. In Proceedings of the 5th Easter Conference on Model Theory, Wendisch-Rietz, Germany, 20–25 April 1987; Seminarbericht; Sektion Mathematik der Humboldt-Universität zu Berlin: Berlin, Germany, 1987; Volume 93, pp. 70–81. [Google Scholar]
Kaye, R. Axiomatizations and quantifier complexity. In Proceedings of the 6th Easter Conference on Model Theory, Wendisch Rietz, Germany, 4–9 April 1988; Seminarbericht; Sektion Mathematik der Humboldt-Universität zu Berlin: Berlin, Germany, 1988; Volume 98, pp. 65–84. [Google Scholar]
Beklemishev, L.D. Induction rules, reflection principles, and provably recursive functions. Ann. Pure Appl. Log. 1997, 85, 193–242. [Google Scholar] [CrossRef]
Kaye, R. Models of Peano Arithmetic; Number 15 in Oxford Logic Guides; Clarendon Press: Oxford, UK, 1991. [Google Scholar]
Hájek, P.; Pudlák, P. Metamathematics of First–Order Arithmetic; Perspectives in Mathematical Logic; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Wilmers, G. Bounded existetially induction. J. Symb. Log. 1985, 50, 72–90. [Google Scholar] [CrossRef]
Zambella, D. Notes on polynomial bounded arithmetic. J. Symb. Log. 1996, 61, 942–966. [Google Scholar] [CrossRef]
Adamowicz, Z.; Bigorajska, T. Existentially closed structures and Gödel’s second incompleteness theorem. J. Symb. Log. 2001, 66, 349–356. [Google Scholar] [CrossRef]
Cordón-Franco, A.; Fernández-Margarit, A.; Lara-Martín, F.F. Existentially Closed Models and Conservation Results in Bounded Arithmetic. J. Log. Comput. 2009, 19, 123–143. [Google Scholar] [CrossRef]
Negri, S. Contraction-free sequent calculi for geometric theories with an application to Barr’s theorem. Arch. Math. Log. 2003, 42, 389–401. [Google Scholar] [CrossRef]
Jěrábek, E. Induction rules in bounded arithmetic. Arch. Math. Log. 2020, 59, 461–501. [Google Scholar] [CrossRef]
Kindermann, C.; Lupp, D.P.; Sattler, U.; Thorstensen, E. Generating ontologies from templates: A rule-based approach for capturing regularity. In Proceedings of the Description Logics, CEUR Workshop Proceedings, Tempe, AZ, USA, 27–29 October 2018; Volume 2211. Available online: https://ceur-ws.org/ (accessed on 15 December 2023).
Coste, M.; Lombardi, H.; Roy, M. Dynamical method in algebra: Effective Nullstellensätz. Ann. Pure Appl. Log. 2001, 111, 203–256. [Google Scholar] [CrossRef]
Boyer, R.S.; Moore, J.S. A Computational Logic Handbook; Academic Press: London, UK, 1998. [Google Scholar]
Kaufmann, M.; Manolios, P.; Moore, J.S. Computer-Aided Reasoning: An Approach; Kluwer Academic Press: Norwell, MA, USA, 2000. [Google Scholar]
Kaufmann, M.; Moore, J. Structured Theory Development for a Mechanized Logic. J. Autom. Reason. 2001, 26, 161–203. [Google Scholar] [CrossRef]

Table 1. Theories to be considered in this paper.

Extension	T Augmented with/Closed Under
$T + E$	$\forall \vec{v} (α (\vec{v}) \to β (\vec{v}))$ , with $α (\vec{v}) \to β (\vec{v}) \in E$
$T + U E$	$\forall \vec{v} α (\vec{v}) \to \forall \vec{v} β (\vec{v})$ , with $α (\vec{v}) \to β (\vec{v}) \in E$
$T + E – Rule$	$\frac{\forall \vec{v} α (\vec{v})}{\forall \vec{v} β (\vec{v})}$ , with $α (\vec{v}) \to β (\vec{v}) \in E$
$T + E^{Π} – Rule$	$\frac{\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to α (\vec{v}))}{\forall \vec{v} \forall \vec{z} (θ (\vec{v}, \vec{z}) \to β (\vec{v}))}$ , with $α (\vec{v}) \to β (\vec{v}) \in E$ and $θ \in {(Π \cup \neg Π)}^{\land}$
$T + E_{\forall}^{Π} – Rule$	$\frac{θ \to α}{θ \to β}$ , with $α \to β \in E$ and $θ \in \exists B (Π)$
$T + E_{\exists}^{Π} – Rule$	$\frac{θ \to α}{θ \to β}$ , with $α \to β \in E$ and $θ \in \forall B (Π)$
${[T, R]}_{m}$	Nested applications of the corresponding rule R with a depth of at most m

where m is a natural number and

θ \in {(Π \cup \neg Π)}^{\land}

expresses the fact that

θ (\vec{v}, \vec{z})

is a finite conjunction of formulas, each of which is in

Π

or its negation is in

Π

.

Table 2. Conservation results demonstrated in this paper.

Subtheory	Conservation	Conditions on E	Reference
$T + E^{Π} – Rule$	$\forall B (Π)$	$α (\vec{v}) \in \forall B (Π)$ , $β (\vec{v}) \in \forall \exists B (Π)$	Corollary 1
$T + E – Rule$	$\forall B (Π)$	$α (\vec{v}) \in \forall B (Π)$ , $β (\vec{v}) \in \forall \exists B (Π)$ , E is weakly $Π$ -reducible	Theorem 1
$T + U E$	$\exists \forall B (Π)$	$α (\vec{v}) \in \forall B (Π)$ , $β (\vec{v}) \in \forall \exists B (Π)$ , E is weakly $Π$ -reducible	Theorem 1
$T + E^{Π} – Rule$	$\forall B (Π)$	$α \in B (\forall B (Π))$ , $α \to β$ sentences	Lemma 5
${[T, E^{Π} – Rule]}_{m}$	$\forall B (Π)$	$α \in \forall B (Π) \cup \exists B (Π)$ , E consists of m sentences	Theorem 3
${[T, E_{\exists}^{Π} – Rule]}_{m}$	$\exists B (Π)$	$α \in \forall B (Π) \cup \exists B (Π)$ , E consists of m sentences	Theorem 3
${[T, E – Rule]}_{m}$	$\forall B (Π)$	$α \in \forall B (Π) \cup \exists B (Π)$ , E consists of m sentences, E is $Π$ -reducible	Corollary 2

where E is said to be weakly

Π

-reducible (modulo T) if

S + E – Rule \equiv S + E^{Π} – Rule

for each theory S extending T, and E is said to be

Π

-reducible (modulo T) if

{[S, E – Rule]}_{1} \equiv {[S, E^{Π} – Rule]}_{1}

for each theory S extending T.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Borrego-Díaz, J.; Cordón-Franco, A.; Lara-Martín, F.F. On Conditional Axioms and Associated Inference Rules. Axioms 2024, 13, 306. https://doi.org/10.3390/axioms13050306

AMA Style

Borrego-Díaz J, Cordón-Franco A, Lara-Martín FF. On Conditional Axioms and Associated Inference Rules. Axioms. 2024; 13(5):306. https://doi.org/10.3390/axioms13050306

Chicago/Turabian Style

Borrego-Díaz, Joaquín, Andrés Cordón-Franco, and Francisco Félix Lara-Martín. 2024. "On Conditional Axioms and Associated Inference Rules" Axioms 13, no. 5: 306. https://doi.org/10.3390/axioms13050306

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On Conditional Axioms and Associated Inference Rules

Abstract

1. Introduction

1.1. From Conditional Axioms to Rules

1.2. Aim and Structure of the Paper

2. Inference Rules and Conditional Axioms

3. A Model–Theoretic Standpoint

4. Normal Conditional Axioms

5. Finite Sets of Conditional Sentences

6. Conclusions and Future Work

6.1. Description Logics

6.2. Coherent/Geometric Logic

6.3. Inference Rules and Automated Reasoning

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI