1. Introduction
Since their introduction in the early 2000s, CubeSats [
1] have revolutionized space mission design. Their standardized design has enabled smaller nations and organizations with limited resources to access space, making them a key research area and a new opportunity for both industry and academia [
2]. The
SMAD [
3] provides a detailed history and highlights the significant changes brought by CubeSats, which include, e.g., a more cost-effective and standardized platform, along with a standardized type of dispensers known as the P-POD [
4]; simpler and more streamlined design phases, resulting in significantly fewer team members and reduced project timelines; and reliability (i.e., the measures of how likely the system is to function without failure in operating conditions [
5]) comparable to that of larger satellite missions.
The
“Space Economy Report 2023” [
6] emphasizes a significant shift in the space industry over the next decade, including a rise in satellite numbers and projected market growth to approximately
$737B. This trend aligns with the predictions of the
"Nanosats Database" [
7]. Industry demands faster space systems design, prompting a shift from Document-Based System Engineering to Model-Based Systems Engineering (MBSE),
“the formalized application of modeling to support system requirements, design, analysis, verification, and validation activities beginning in the conceptual design phase and continuing throughout development and later life cycle phases” [
8], in parallel with the further adoption of Concurrent Engineering (CE) methods.
In contrast to conventional sequential design processes, where specialists work independently on their respective subsystem designs, CE is characterized by the simultaneous collaboration of multiple engineers. Each engineer brings their specific expertise to the table, distinguishing CE by its collaborative, integrated, and real-time approach [
9,
10]. Knoll et al. [
11] identify key technical challenges in adopting Concurrent Engineering (CE) methodologies, notably the necessity for robust integration among discipline-specific tools and the comprehensive use of MBSE.
CE and MBSE approaches are gaining traction for their efficiency, accuracy, and collaborative benefits in engineering, as per the International Council on Systems Engineering (INCOSE) in its
"Systems engineering Handbook" [
12]; this view is also supported by Henderson and Salado [
13] and Syan and Menon [
14]. As can be seen, the benefits mentioned above correspond to needs expressed by the space sector, such as enhancing how space systems are developed, and allowing real-time collaboration to expedite the design, as per Knoll et al. [
11].
INCOSE
“Systems engineering (SE) Vision 2035" [
15] foresees that the future of SE heavily relies on MBSE, with advanced models and integrated simulations that enhance system complexity management, efficiency, and reliability. By centralizing information within a cohesive model, MBSE, when integrated with simulations, promotes a shared and comprehensive understanding of complex systems. This approach aids in identifying design limitations or incompatibilities, thereby preventing time and budget overruns, particularly during the operational phase. The INCOSE
“SE Handbook" [
12] indicates that this aspect grows increasingly important as system complexity rises.
However, the implementation of MBSE is faced with a set of challenges, including data discontinuities across various design stages and a lack of reusable generic models, as highlighted by Knoll et al. [
11] and Salas Cordero et al. [
16]. These challenges may impact system models, simulation tools, and development cost reduction. An in-depth review of such discrepancies in tools used to design and analyze complex systems in different design phases is available to the reader in the work of Bajaj et al. [
17].
Some works (e.g., Bajaj et al. [
17] and Spangelo et al. [
18]) present tools that partly address these gaps as they were developed to facilitate integration between systems modeling language (SysML) models and specific simulation tools. Such tools, although offering valuable insights towards the integration of SysML models and various simulation tools, focus on proprietary software that not everyone has access to.
Scholz and Juang’s work [
19] discusses the advantages of
Open Design (In Scholz and Juang [
19], open Design is referred to as the combination of Open Source Software (OSS) and Open Source Hardware (OSHW)) for sectors such as space mission design. Specifically, for budget-constrained projects often seen in academic settings, this approach can enhance data sharing among various entities, thereby fostering the development of more reliable, cost-efficient, and innovative solutions. This is largely due to the extensive peer review that is inherent in the open source community. Furthermore, adopting the open source model in CubeSat development within academia could simplify the design process. It enables the reuse of ideas, software, and hardware, facilitating collaboration between different groups. Project members can concentrate on distinct aspects of the same project without the complications posed by confidentiality agreements [
19].
The push for an open source software ecosystem has led to the development of software frameworks such as the Nanostar software suite (NSS) v1.1 [
20,
21], which supports a unified database across principal analysis software during the preliminary design (PD) phase. However, the NSS lacks integration to modeling languages such as the unified modeling language (UML) or the systems modeling language (SysML), a gap pointed out in the studies by Salas Cordero et al. [
16]. The work [
16] highlights that using MBSE can lead to streamlining the mapping of system element connections, which can enhance change management, an aspect crucial throughout the entire life cycle of a product.
The work presented in this paper strives to pave the way toward closing the gap between the expected benefits of implementing MBSE and CE approaches and their deployment in an open source framework specifically applied towards the development of CubeSats, answering two main research questions:
How to formalize the integration of MBSE tools and modeling languages with third-party open source simulation tools for application throughout the entire PD phase?
How to formalize a CubeSat model that is general enough to be reusable across different mission designs, yet detailed enough for a comprehensive representation of the systems?
The research work presented in this paper is set within the context of the Nanostar software suite (NSS) [
16,
20], also explored during a master’s thesis work [
22]. This paper aims to formalize a generalized SysML CubeSat model that has been enhanced through the use of UML stereotypes for data characterization at any design stage and is capable of being linked with any set of open source simulation tools, benefiting researchers, engineers, and students in CubeSat development. The formalization aims to streamline the design process and improve CubeSat mission PD, while procuring the reusability of the framework throughout different CubeSat missions and issuing the need to reduce the fragmentation of data and information in SE, thus posing the CubeSat model as the central source of truth for data across all design stages. To the best of the authors’ current knowledge, there is no open source framework able to address all the stated challenges and requirements in the scope of this research work.
The proposed integration process (represented in
Figure 1) enables seamless interaction between the CubeSat model and simulation tools; and it is explained with greater detail throughout the forthcoming sections. The proposed framework possesses the following main three steps:
Data characterization within a CubeSat SysML model through UML stereotypes. This includes the proposed generalization process for the operating modes (OMs) of a spacecraft (S/C) as a way to obtain a generalized activity profile;
Generation of a model file parser for data extraction and generation of a software-readable data structure;
Implementation of the newly generated data structure with any selected set of simulation and analysis tools, e.g., the ones from the NSS constellation.
The novelty of this approach resides in its capability to include relevant design parameters within a CubeSat model (step 1); information then is automatically extracted through a dedicated parser (step 2) for simulation and analysis activities in third-party tools (step 3).
This paper is organized as follows:
Section 2 provides an overview of the principal budgets and data required for PD phase studies, which are included in the proposed CubeSat model formalization.
Section 3 details the creation of a CubeSat SysML model that is augmented with the help of UML stereotypes. This feature, thoroughly discussed later, facilitates the integration of the model with diverse simulation and analysis tools. A formalized approach for CubeSat OMs is also introduced, aimed at generating a general activity profile (which delineates the variety of tasks or operations that the S/C is expected to execute during its mission, encompassing the transitions between operation modes) for mission analysis and simulation.
Section 4 presents an end-to-end application of the proposed formalization, serving as a proof of concept of the latter, by modeling a theoretical space mission concept and the integration of such model with the NSS simulation tools.
Section 5 discusses the main results of this work and compares its main outcomes with the current literature. Finally, the paper concludes by summarizing its main findings and outlining potential directions for future research. Two Appendices are included at the end of this work:
Appendix A presents the main SysML and UML elements mentioned within this paper for reference, while
Appendix B presents the system-level block definition diagram (BDD) of the presented case study for more detail.
2. CubeSat Preliminary Design
The CubeSat standard has revolutionized the space industry by its simplicity and affordability, as highlighted in the
SMAD [
3]. This innovation has increased access to space, allowing a wider range of institutions, including smaller academic and research organizations, to participate in space exploration.
The preliminary design (PD) phase of a satellite, even for a compact CubeSat, requires addressing multidisciplinary challenges. These include balancing technical constraints like mass, volume, power, and data, each with distinct challenges and limitations. Additional complexities such as communication budgets and mission-specific requirements add to the design intricacies.
The work from Gateau et al. [
21] provides a detailed examination of these challenges, offering critical insights into the budgets and studies necessary to assess CubeSat mission feasibility, summarized in
Table 1.
Mass Budget: Crucial for spacecraft engineering, the mass budget details total mass, including structural elements, harnesses, and propellants. It also integrates margins to handle uncertainties and changes across design and operational phases.
Volume Budget: In S/C design, the volume budget specifies internal volume constraints and ensures each component fits within these limits, including operational margins.
Power Budget: Essential for mission viability, this budget calculates power needs for all S/C components and generation capabilities, considering the impact of operational orbit on eclipse durations. Power margins are managed to accommodate energy fluctuations.
Link Budget: Critical for mission communication, the link budget evaluates signal strength, transmission losses, and antenna sensitivity. It considers factors like antenna gain, travel distance, and atmospheric losses, ensuring reliable S/C communications.
Data Budget: This budget manages S/C data capabilities, estimating data generation, processing, storage, and downlink needs based on mission duration and orbit. It prioritizes data types and volumes to maximize mission value within resource limits.
Other Budgets: For complex missions, additional analyses may include propellant, radiation, and heat dissipation budgets, or focus on attitude and orbit control system (AOCS) sizing and operational specifications.
Margins: As per the European Space Agency (ESA)’s
Margin Philosophy for Science Assessment Studies [
23], margins at system and component levels accommodate design and operational variations, ensuring robust S/C functionality.
Activity Profile: Outlines the S/C operations based on the concept of operations, varying with mission objectives. This study develops a versatile activity profile for various CubeSat configurations.
3. Data Characterization within a CubeSat SysML Model
This section delves into the information required to model a CubeSat when taking into consideration its operating modes and how to characterize with a modeling language the overall data required for simulations, analyses, and trade-offs.
3.1. Operating Mode Generalization
As discussed in the preceding sections, the development of comprehensive system budgets often requires the identification of the S/C’s activity profile. This profile outlines the range of operations or activities that the S/C should perform during its mission, including transitions between operating modes. Derived from the concept of operations (ConOps), the activity profile can vary significantly depending on the specific objectives and design of the Spacecraft (S/C) and, thus, is typically tailored ad hoc for each mission.
To enhance the modeling abstraction capabilities of the proposed formalization, before delving into data characterization within a CubeSat model, it is essential to explore methods for generalizing the activity profile of a CubeSat. Consequently, this paper introduces a generalized approach to define the operating modes of a CubeSat, drawing on common attributes and criteria across various low Earth orbit (LEO) CubeSat concepts of operations.
3.1.1. Existing Approaches to OM Identification
An SE framework for CubeSat design to define operating modes (OMs) and their interdependencies can be found in Asundi and Fitz-Coy’s work [
24]. They state that the mission concept of operations (ConOps) is a crucial document for analyzing mission objectives and operations. This framework identifies mission phases, their OMs—such as a
safe mode, a
communication mode, or a
mission operations mode—and transition conditions, by defining the spacecraft’s activity profile. An example of ConOps is proposed in the National Aeronautics and Space Administration (NASA)
“Systems Engineering Handbook” [
25], along with the elements involved in its definition.
An abstracted sequence of operations (i.e., the tasks and functions that the S/C must complete during its operative life) for CubeSats can also be defined [
26], as many operations are similar or closely related enough to allow standardization of the OMs and ConOps.
As outlined in Jain’s work [
26], a standardized method of describing the operating modes, at least in terms of control sequences and minimum requirements for each OM, can be developed by analyzing the commonalities among different ConOps of LEO CubeSat missions (e.g., SwampSat [
24], Waydo et al. [
27],
and APEX [
28], VISORS [
29], Cat-2 [
30], and DICE [
31]).
Therefore, the rest of this section aims to formalize a general S/C’s activity profile. To achieve this, it is possible to identify common characteristics among the ConOps and OMs of several CubeSats in low Earth orbit (LEO) missions, through a process similar to Jain’s work [
26], as presented in the following.
3.1.2. Proposed OM Generalization
It is important to note that although comparing different missions, most of the analyzed ConOps (e.g., Waydo et al. [
27], Lightsey et al. [
29], Fish et al. [
31]) present similar OMs and transition criteria.
In particular, it can be observed how the operating modes related to early orbit phase (EOP) and off-nominal scenarios, such as detumbling, calibration, or safe modes, are almost always considered in their missions’ ConOps, with various connotations. While certainly critical from a design point of view, such OMs are not considered for the current analysis. They represent non-nominal conditions and, as a result, have properties and transition conditions that need to be studied in more detail to determine whether they can be generalized, and if so, what would be the best manner to carry this out. The operating modes relative to the operative phase of the S/Cs are the ones considered further in this paper.
Some of the OMs usually present are a servicing mode and a communication mode, which refer, respectively, to the cases during which the payload is active or a downlink/uplink with a ground station (GS) is present. A central stand-by mode is often considered to reduce power consumption when, e.g., the payload is off or if no ground stations are visible.
Some other OMs have been found throughout the literature, such as the following: Carreno-Luengo et al. [
30] discusses how some OM transitions can occur based on the state of charge of the batteries, for example, during eclipse periods when no power from the solar arrays is available. Morton and Withrow-Maser [
28], conversely, consider the attainment of a specific orbital configuration as a transition criterion between two OMs. The main discrepancies between the various OMs can thus be reduced, at the level of detail desired, to when a specific OM is activated and which subsystems are active in that mode. This can be further simplified to the differences in average power consumption and data rate.
The proposed generalization offers a structured approach to defining any set of OMs, integrating a range of parameters and sub-parameters that capture the essence of an S/C’s operational scenario.
Table 2 summarizes such generalization.
The following general transition criteria for the operating modes have been identified among the most common present in the literature:
Presence of eclipse: CubeSats often switch to a low-power mode during eclipses to conserve energy and minimize battery depth of discharge (DOD), as discussed in Asundi and Fitz-Coy [
24], Waydo et al. [
27], and Carreno-Luengo et al. [
30].
Presence of GS or specific target in line of sight (LOS) with the S/C: Communications with a Ground station (GS) to receive commands and downlink data are critical, as highlighted, i.e., by Asundi and Fitz-Coy [
24] and Fish et al. [
31].
S/C over specific Latitude/Longitude zone: For Earth observation missions, such as those noted in Carreno-Luengo et al. [
30], satellite payloads activate over targeted areas to collect mission-specific data.
S/C in specific true anomaly range: Missions may require system activation at certain orbital positions for optimal operation, as explained in Lightsey et al. [
29].
Activation at a specific time: Determined by
Kepler’s Equation, some missions plan operations based on the time since periapsis to align with mission timelines, detailed further in Fitzpatrick’s work [
32].
The analysis of various ConOps, such as in Waydo et al. [
27], shows the common use of a central
standby mode, activated when no specific operational conditions are met. To generalize, a
default mode check is essential, allowing one of the generalized OMs to become active when other specific operating mode conditions are not triggered, making this the central mode and thus ensuring continuous operation and readiness for mode transitions.
Considering each mission’s unique goals, it is crucial to allow users to set the priority for each OM activation. By using a priority counter in each operating mode definition, the models can dynamically adjust OM based on their priority levels, allowing higher priority modes to override lower ones, thus adapting to varying mission scenarios.
Key distinctions between OM influence system budgets (
Table 1) and include variations in power consumption, data transmission rates, and data quantities. Accurately defining these differences is vital for modeling the CubeSat’s operational dynamics and resource needs across different OMs.
Introducing metadata for each OM, such as mode ID, name, and post-processing colors for visualization, enhances simulation clarity and usability. These metadata facilitate better distinction and interpretation in analyses and presentations, offering clearer insights into the CubeSat’s operational profile.
By employing this general model in dedicated data structures, it becomes feasible to simulate a wide range of mission scenarios. This flexibility is invaluable in testing and refining various ConOps and operational strategies, making it a versatile tool in S/C design and mission planning. Such a comprehensive model not only aids in theoretical planning but also has practical implications, significantly enhancing the efficiency and effectiveness of mission simulations.
3.2. Data Characterization through UML Stereotypes
The proposed formalization employs unified modeling language (UML)
stereotypes to specify the attributes of blocks within systems modeling language (SysML) block definition diagrams (BDD) (see
Figure 2). For the formalization, stereotypes were implemented as extensions to the UML
Class metaclass. By extending this metaclass, any SysML block that a stereotype is applied to inherits the attributes of that stereotype, as detailed in
Appendix A. This approach not only enriches the detail and functionality of the blocks but also streamlines the modeling process by leveraging inherited characteristics.
Below, the revised set of stereotypes is introduced, and they are SystemComp; SystemMain; Battery and SolarArray; Transmitter and Receiver; Orbit and PropagationLosses; GroundStation; and last but not least, Operating Mode. These enhance the characterization of parameters within the CubeSat SysML model. Adding these stereotypes expands the model’s functionality, without compromising its ability to represent the logical and functional behaviors of the elements.
3.2.1. SystemComp
SystemComp establishes the core stereotype for a physical component of the S/C, capturing key metrics such as the system’s mass, power usage, and applicable margins, along with essential metadata like ID, name, and manufacturer for tracking components.
To further tailor the model, power usage attributes are segmented into powerConsIsStb, powerConsIsOn, and powerConsIsPeak, corresponding to stand-by, on, and peak power consumption statuses of each subsystem and component, respectively.
A table detailing the attributes linked with this stereotype is presented in
Table 3, while
Figure 2 illustrates their UML definition.
As depicted in
Figure 2, additional stereotypes further delineate
SystemComp: SystemMain, Battery, SolarArray, Transmitter, and Receiver. The applicable margins follow the ESA
“Margin philosophy for science assessment studies” [
23], which requires that they are considered at both component and system levels. As a result, they are included in the SystemComp stereotype, causing them to also be inherited by the stereotypes derived from it, including the SystemMain stereotype that is discussed below.
3.2.2. SystemMain
This stereotype can be used to define primary sub-systems of the S/C (e.g., electrical power system (EPS), propulsion system, payload, etc.). Beyond the attributes from SystemComp, it includes the list of OMs in which the system can operate for each power consumption status (stand-by, on, or peak consumption statuses have been considered for an active component), the system volume, and two Boolean flags for data extraction and analyses. Moreover, the computeMass and computeCons attributes help indicate whether simulation tools should account for the masses and power consumption of the system’s sub-components: if all components of a system are available and their respective masses (or consumption) are specified, such masses (or consumption) can be aggregated to determine the total mass (or consumption) of the system. This scenario is applicable when computeMass (or computeCons) is set to True. If, conversely, only a systemMain block has been defined (e.g., an AOCS without a detailed BDD for systemComps like sensors, actuators, etc.), but it is still needed to perform analyses, computeMass can be set to False. In this case, the software is told to use the mass specified for the block with the systemMain stereotype applied without searching for the masses of individual systemComps.
These are introduced to aid the database creation even in the early design stages when the full system architecture is still being defined.
Table 4 lists the additional attributes introduced by the SystemMain stereotype.
3.2.3. Battery and SolarArray
Battery and SolarArray stereotypes encompass the battery’s capacity and the power generated by solar arrays, in addition to the attributes introduced by SystemComp.
3.2.4. Transmitter and Receiver
These stereotypes define crucial variables for the communication system elements affecting the link budget, such as transmitter and receiver antenna gain and power, various sources of signal losses, and transmitted data rate. Their attributes are summarized in
Table 5.
3.2.5. Orbit and PropagationLosses
The
Orbit and
PropagationLosses stereotypes specify the required properties for orbital and link budget analyses, as shown in
Figure 3 and summarized in
Table 6.
3.2.6. GroundStation
The
GroundStation stereotype enables the definition of all relevant data for identifying ground stations, including position, altitude, and antenna data. This stereotype’s definition is presented in
Figure 4, while
Table 7 summarizes the introduced parameters.
3.2.7. OperatingMode
The
OperatingMode stereotype is essential for replicating the operating mode generalization presented in
Table 2. This stereotype is closely tied to the systems’ definitions, as the power consumption of each OM is calculated based on the power consumption of each subsystem or, optionally, cascaded from the individual components of each subsystem.
Three attributes have been added to the SystemMain stereotype to denote the OMs during which a particular system is in standby, on, or peak mode: modeIsStb, modeIsOn, and modeIsPeak. These enable the simulation tools to associate the power consumption of a specified system (being this either stand-by, on or peak for each mode) with the total consumption of each operating mode.
Moreover, an isDefault Boolean attribute is present in the OperatingMode stereotype, allowing the definition of an active OM (isDefault = True) when no activation criteria are satisfied.
This stereotype is shown in
Figure 5 and its attributes summarized in
Table 8.
3.2.8. Stereotype Illustrative Example: Payload Block Definition Diagram
The previous subsection presented how the necessary stereotypes were defined for the proposed formalization. This section showcases how the stereotypes can be utilized in a model using BDDs.
Figure 6 shows an early design phase of a global navigation satellite system (GNSS) payload—from a case study presented more in detail later in this work—and its BDD utilizing the aforementioned stereotypes.
It is composed of a
SystemMain block—the payload—and a set of
SystemComp blocks, connected through
Composite Associations (for more information about UML and SysML elements, including the above-mentioned Composite Associations, please refer to
Appendix A). Thus, it is possible to understand how the payload is composed of a clock monitoring and control unit (CMCU) and an atomic frequency standard (AFS), with the latter additionally composed of a passive hydrogen maser (PHM) and a rubidium atomic frequency standard (RAFS).
The computeCons and computeMass attributes from the SystemMain stereotype facilitate data extraction at any design stage, especially when detailing individual components of a system is not feasible, as happens in this case. The example above illustrates that while the masses of the components are defined, the computeMass variable is set to True, indicating that the mass of the payload is derived from the masses of its components. The same happens for the mass of the AFS block, composed of two other blocks (PHM and RAFS).
It can be noted how the masses of both payload and AFS are initialized at a null value, which is overwritten during the data extraction with the sum of the respective components.
Meanwhile, the computeCons variable is appropriately set to False, as the power consumption of various components is not yet specified. However, this does not hinder future interactions with simulation tools, as the overall payload’s power consumption is defined at the systemMain level.
As discussed in the preceding sections, the SystemMain stereotype enables also the definition of various power consumption statuses (stand-by, on, peak), to be associated with relevant OMs. In the example above, the payload is in stand-by—thus consuming 84 W—during a stand-by OM and a phasing OM, while it is on—consuming 105 W—during a servicing OM. Should there be other OMs for the represented mission, it is implied from this BDD that the payload will be off.
As per ESA
“Margin philosophy for science assessment studies” [
23], a 5% margin in this case is applied at both the component and system levels.
The proposed framework considers redundancies and inheritance of characteristics when the sub-levels of the description required are not available at a certain point in the design of the system. As can be seen in
Figure 6, the proposed formalization accounts for component redundancies (for instance, in the case of the AFS block) by noting the
multiplicity (i.e., the number near the arrowhead of each composite association, also present in the
parts section of the block) of each composite association in the BDD.
3.3. Model Parser
Most of the open-source modeling tools (e.g., Capella [
33], Papyrus [
34], Gaphor [
35]) store their models in
.xml format [
36]. This is a common file format for storing and transmitting data structures with a pre-defined set of rules.
Understanding the unique rules and structures of model files enables the creation of simple Python libraries for data extraction and manipulation of .xml files. The formalization proposed is beneficial as it leverages common attributes among SysML blocks with similar stereotypes, promoting uniformity in the model files. The connections among stereotypes, their attributes, and the associated blocks can be exploited for developing a parser that converts all the data included within the SysML model into a software-readable data structure.
Therefore, with prior knowledge of the stereotypes and attributes within a model, and once its file structure is known, extracting all the necessary data for simulation and analysis activities becomes feasible, independently from the variety and number of blocks in the model. Starting with the stereotype definitions, all the necessary properties for simulations and system budget generation are derived through cascading cross-references within the file structure.
In Pons et al. [
37], it is shown how the formats between tools are not by default interchangeable. Hence, although the formalization is tool-independent, a thorough understanding of each tool’s model file structure is crucial for the development of a dedicated model file parser. While the format of model files is consistent across different modeling tools, the method of recording information in these files varies by tool. Therefore, this paper focuses on a parser for models created with Gaphor that uses the proposed stereotypes.
The parsing process can also be reversed to enable automatic updates to model files following analysis or optimization activities. This approach confirms the role of the model as the central source of data throughout all project stages, seamlessly integrating design modifications without manual intervention.
Other relevant outputs from the parsing process include, e.g., utilizing tools like the Graphviz library [
38], enabling the automatic creation of a dependency graph (presented later in this work), which visually maps out the interrelationships among model elements and their attributes, clarifying how changes to one variable may impact the entire model.
This automated workflow increases precision and efficiency, ensuring that any adjustments or improvements from analysis or optimization are systematically and accurately reflected in the model. This not only maintains the model’s accuracy and relevance but also supports a more agile and responsive design process, allowing changes to be quickly incorporated and available for further iterations or evaluations.
Once the stereotypes necessary for characterizing the S/C and mission elements have been defined, they can be reused for describing other missions, varying only by the data entered within a model. Hence, the parser itself, being developed to work with the proposed set of stereotypes, is reusable with any model compliant with the proposed formalization.
4. Use Case Application
This section presents an application of the proposed formalization using a SysML model enhanced with UML stereotypes and linked to simulation tools from the Nanostar software suite (NSS) constellation [
20,
21]. The chosen case study is the
“The White Compass (TWC)” mission, a theoretical academic concept developed during the
“Space Mission and Systems Design” course at Politecnico di Torino. The TWC mission aims to deliver GNSS services across the Arctic region using a constellation of 12 small satellites in two high elliptical Molniya-like orbits (for more on elliptical orbits, see R. Fitzpatrick’s
“An introduction to Celestial Mechanics”, pp. 46–52 [
32]).
The core of each TWC S/C is detailed in a comprehensive SysML block definition diagram. Each BDD, such as the one illustrated in
Figure A4, applies specific UML stereotypes that are essential for defining each system and component of the S/C, accurately aligning with the mission’s objectives. Additionally, the SysML model for the TWC mission includes detailed, system-specific BDDs, orbit characterizations, ground stations, operating modes, and requirements. The full extent description of the mission and diagrams can be found in the project’s public repository. These elements provide a holistic view of the mission, covering the S/C, its operational environment, and mission-critical parameters.
The summary of the main parameters included within the model to obtain a power budget and a data budget is reported in
Table 9. The full data set is present in the work’s public repository.
After characterizing the S/C model, the next step is to extract the pertinent data from the model file. The parser primarily targets blocks with applied stereotypes. This targeted approach boosts the flexibility and broad applicability of the SysML model, ensuring that only essential data are extracted for data structure creation. This adaptability is particularly valuable, enabling the model to represent any system at any design stage, as long as the required stereotypes are implemented.
Finally, once the database is established, it is possible to engage in analysis and simulations using NSS tools (e.g., to generate the desired system budgets, as shown in
Figure 7) or other preferred simulation platforms. This phase is augmented by the capability to modify the model files directly from the Python scripts, enabling dynamic updates and adjustments to the model based on database inputs.
The entire process, once the stereotypes are defined within the model for the characterization of all relevant design parameters, is continued by means of the dedicated parser. It links the model to the chosen set of simulation tools by extracting the relevant data from the model files and generating a software-readable database.
The parser allows the parsing process to be inverted to facilitate automatic updates to model files after analysis or optimization tasks. This method establishes the model as the center reference for data across all project phases, effortlessly incorporating design changes without the need for manual intervention. In other words, the model is seen as the single source of truth.
A supplementary result of the parsing process is a requirements list (an excerpt is shown in
Figure 8). This is possible due to the nature of the
requirement element in SysML as it is a stereotype and thus manipulable by the model parser through the same logic used for the stereotypes introduced by the proposed formalization. This feature is especially valuable for modeling tools lacking the functionality to export
requirement diagrams as tables, or when the specific formatting of such tables is desired.
Another product of the parsing process is the dependency graph, illustrated in
Figure 9. For clarity, only connections related to the modeled TWC payload are shown (see
Figure 6). A complete relationship graph featuring all model elements is available in the project’s public repository.
In the SysML model, establishing
associations or other connections is essential for defining the relationships between different blocks. For instance, in the case of the
SystemComp and
SystemMain blocks shown in
Figure 9, these connections facilitate the cascading of critical properties like power consumption and mass from the component level to the main system block. Additionally, the operating modes are linked to the
SystemMain blocks through attributes such as
modeIsStb,
modeIsOn, and
modeIsPeak.
Exploiting dependencies is crucial for effectively leveraging the SysML model in the design process. By extracting data for the database, it becomes feasible to generate a relationship graph. This graph is populated by all blocks to which a stereotype has been applied. This graph provides a comprehensive visual overview of the dependencies and interactions among various elements within the model. Such a graph can offer more clear insights as to how modifications in one part of the system could affect others, by enabling more strategic and informed decision-making throughout the design phase.
When considering the entire CubeSat model, this method could allow for the inclusion of other vital systems, such as ground stations or the S/C’s communication system, revealing all critical relationships for data and link budget analysis. Moreover, adding elements like solar arrays, batteries, and an EPS, along with orbital parameters, would help visualize all interrelationships crucial to a power budget. Such an extensible relationship graph would present a holistic view of the system, underscoring the complex web of dependencies defining the S/C’s elements and the ramifications of changing one variable on the overall design.
5. Discussion
As mentioned, the preliminary design of a CubeSat involves extensive multidisciplinary efforts and complexity. Despite the increasing use of advanced SE approaches such as MBSE and CE, the fragmentation in data storage and transmission continues to be a significant challenge for the SE sector [
8,
15]. These challenges, in fact, hinder the seamless flow of information across different stages of CubeSat development, impacting efficiency and integration. Therefore, the approach proposed in this paper addresses the issues regarding a seamless flow of information. This issue was identified as well as a primary challenge in the adoption of CE methodologies in Knoll et al. [
11]. The need for a robust integration among various discipline-specific tools and the comprehensive application of MBSE is clear.
There are many inconsistencies regarding the interoperability of tools. For a detailed analysis of the inconsistencies throughout tools used for designing and analyzing complex systems at different design phases, readers can refer to Bajaj et al. [
17]. This study highlights the following two main gaps in particular:
Gap 1: The lack of model-based continuity in design and simulation activities across mission stages due to the use of different tools in initial versus later phases, leading to inconsistencies.
Gap 2: The lack of continuity between design and simulation models within each design phase, particularly between conceptual designs and mathematical models in the early stages.
Therefore, this study introduced a formalized CubeSat SysML model enhanced with UML stereotypes for data characterization, designed with flexibility and compatibility across various simulation tools in mind, demonstrating its utility and flexibility within the space system domain. The proposed framework is summarized in
Figure 10.
The diagram uses color-coded blocks to illustrate input/output relationships, following the order of the arrows. Blocks sharing the same color convey the same information and serve as either inputs (if preceding an arrow) or outputs (if following an arrow) of a specific step within the proposed formalization. More specifically,
Red: data that are required for analyses and simulation activities, to be included within the S/C model.
Green: baseline framework and model prior to the application of the proposed formalization.
Yellow: set of stereotypes to be included within the model for “red” data characterization.
Blue: newly developed model parser for data extraction and outputs (e.g., software-readable data structure, requirements lists, dependency graphs).
Purple: set of simulation and analysis tools to be linked with the model, which require the “red” data, and outputs.
The double coloring of the specific blocks describes the various information carried out by each of them. Specifically, the design parameters—in red—are represented in the model—and thus in the model file—through the proposed set of stereotypes. This information is then extracted through the parser and transferred to the newly-generated database, which is linked to the desired set of simulation and analysis tools.
The uniqueness of the presented formalization is that it offers the academic and open source community an open source framework specifically tailored to the preliminary design of CubeSats, addressing the fragmentation in data storage and allowing data transmission and simulations to be executed at any design stage.
The innovation of the proposed formalization with respect to other works in the literature is its language and tool agnosticism. Being developed with an arbitrary set of open source modeling and simulation tools, it can be tailored to the needs of those that want to apply it with their preferred solutions. As a drawback, the current state-of-the-art model parser does not allow data to be automatically extracted from models developed with tools other than Gaphor without modifying the source code.
Moreover, in contrast to what has been found in the literature, where models are mostly used as a connection point between databases and external software, the proposed model formalization serves as a central data reference throughout the preliminary design (PD) phases, integrating seamlessly with open source tools, including, in this work, the Nanostar software suite (NSS). This integration enabled the generation of power and data budgets for the TWC theoretical mission, showcasing the model’s practical applicability. The use of UML stereotypes provided a consistent structure for the model files, enhancing precision and efficiency in data extraction and manipulation. The capability for data extraction and translation in a software-readable data structure, coupled with automatic model updates after analyses and optimizations, minimized manual intervention, supporting a dynamic and responsive design process.
Given the aforementioned strengths and broad applicability with any set of open source software and modeling tools, the authors envision the formalization of the proposed framework to be used, for example, in academic and budget-constrained environments, as a support for systems engineers and the entire development team, for all stages of PD. The further application of real-time versioning and cloud storage could be also beneficial for CE practices, posing once more the proposed model as the pivotal point of the entire design process.
However, the development of the proposed CubeSat SysML model formalization and its integration with open-source tools for PD involved several limitations and design decisions, and assumptions (detailed in the next paragraphs). These aspects are essential for understanding the scope, applicability, and potential areas for improvement of the proposed formalization.
One significant limitation encountered, as anticipated, was the modeling-tool-specific nature of the parser. The parser was tailored specifically for models created with Gaphor, because of the nature of the model files to be parsed. As highlighted in Pons et al. [
37], the format in which the models are saved actually makes them not fully interoperable with other modeling tools than the ones they were created in. This restricts the immediate applicability of the parser to models created using other SysML tools. Gaphor was chosen due to its open-source nature and compatibility with the proposed UML stereotypes. However, in order to enhance usability across different platforms, future versions of the parser shall consider broader compatibility across tools.
Another limitation is the focus on low Earth orbit (LEO) missions, which formed the basis of the generalization of operating modes (OMs) and the formalized approach. This focus excludes non-nominal conditions such as detumbling, calibration, or safe modes. LEO missions were chosen due to their commonality and relevance in current CubeSat applications. Extending this framework to other mission types and non-nominal conditions remains a key area for future research. Additionally, the formalized approach did not fully incorporate OMs related to the early orbit phase (EOP) and off-nominal scenarios. The complexity and variability of these conditions were deemed beyond the scope of the presented framework, and a detailed study and potential inclusion of these scenarios could provide a more comprehensive model.
Balancing the creation of a generic framework with the need to address specific mission requirements posed challenges. A generic framework was prioritized to ensure broad applicability, with the understanding that specific mission requirements would necessitate further customization. This design decision supports reusability but may require additional adjustments for highly specialized missions.
Several key design decisions were made to address these limitations and guide the development process. The use of UML stereotypes was implemented to enrich the detail and functionality of SysML blocks, driven by the need to standardize data characterization and facilitate seamless integration with simulation tools. UML stereotypes allow for a structured approach to data characterization, ensuring consistency and enabling automated data extraction and manipulation.
The CubeSat SysML model was designed to act as a central data reference throughout all PD phases. Positioning the model as the single source of truth ensures coherence and accuracy across all design stages, supporting efficient updates and modifications. The exclusive use of open source tools, including the NSS, was another fundamental design choice. Open source tools promote transparency, accessibility, and collaboration, aligning with the academic and budget-constrained contexts in which CubeSats are often developed.
The generalization of OMs aimed to standardize the activity profiles for CubeSats by identifying common attributes and criteria across various missions. This design decision supports the creation of versatile and reusable models that can be adapted to different mission scenarios.
The future work may focus on expanding the tool compatibility by developing parsers compatible with multiple SysML tools to enhance the framework’s versatility. Extending the OM generalization to include non-nominal conditions and other mission types beyond LEO is also crucial. Enhancing the model to accommodate more complex sub-systems such as AOCS and propulsion, enabling comprehensive analyses like Delta-V and propellant budgets, should provide significant advancements in the preliminary design.
6. Conclusions and Future Perspectives
This paper introduced a formalized general CubeSat SysML model, enhanced with UML stereotypes for data characterization. This model has been proposed to serve as the central data reference throughout all PD phase stages, acting as the primary design interface.
The innovation of this work stands in the flexibility of its application within the space systems domain. A CubeSat SysML model can be created with any tool, with the condition that this tool can define any diagram or element as long as the proposed set of UML stereotypes are applied, without affecting integration with simulation tools. Moreover, once the proposed set of stereotypes are integrated within the model, it becomes possible to reuse them for different designs by only varying the introduced data.
Another result is the generalization of operating modes for CubeSats, identifying common attributes and transition criteria among various LEO CubeSats concepts of operations. This allows for different mission scenarios to be defined by just modifying OM attributes, implemented in any data format.
The use of UML stereotypes identifies common structures within the model files, enabling the extraction of key data for integration with any preferred set of simulation tools. Dependency graphs generated from this data extraction process visually depict the impact of design changes, aiding trade-off studies during the PD phase.
Additionally, creating a specialized parser for the model files allows for automatic updates to the model itself, such as after a simulation or parameter optimization phases.
A proof of concept is presented with the SysML model of the TWC mission, integrated with the NSS mission analysis tools via a dedicated parser, successfully generating power and data budgets.
Future works, mentioned as well in the previous discussion section in more detail, could address approximations by the NSS tools and limitations of the OM generalization for low Earth orbit (LEO) and early orbit phase (EOP). Moreover, a more thorough characterization of complex systems, i.e., AOCS or propulsion systems, could allow the generation of budgets such as Delta-V budgets and propellant budgets, as well as the representation of all S/C element dependencies through a comprehensive dependency graph. An interface to allow user(s) to choose what outputs to generate from the parser could be explored. Expanding these aspects would allow for modeling a broader range of missions and integrating more complex scenarios, potentially also incorporating other database formats like SQL.