EasyHybrid.jl
Documentation for EasyHybrid.jl.
EasyHybrid.EasyHybrid Module
EasyHybrid.jlEasyHybrid is a Julia package for hybrid machine learning models, combining neural networks and traditional statistical methods. It provides tools for data preprocessing, model training, and evaluation, making it easier to build and deploy hybrid models.
The hybrid model combines a neural network h(x; θ), with inputs x and learnable parameters θ, together with a mechanistic model M(·, z; ϕ) driven by forcings z and parameterized by ϕ, where ϕ may be known, learned from data, or fixed.
- Documentation https://earthyscience.github.io/EasyHybrid.jl/
EasyHybrid.DataConfig Type
Configuration for data preparation and loading.
Controls array types, observation shuffling, data splitting, cross-validation, and sequence construction for time-series training.
sourceEasyHybrid.HybridParams Type
HybridParams{M<:Function}A little parametric stub for “the params of function M.” All of your function‐based models become HybridParams{typeof(f)}.
EasyHybrid.InputBatchNorm Type
InputBatchNorm(chs; kwargs...)A wrapper around BatchNorm that handles 3D sequence input (features, timesteps, batch).
Lux's BatchNorm expects channels in the penultimate dimension, which works for 2D input (features, batch) but fails for 3D input (features, timesteps, batch) where features are in dim 1. This wrapper reshapes 3D input to 2D (features, timesteps * batch) before normalization, then reshapes back. For 2D input, delegates directly to BatchNorm.
EasyHybrid.LinearHM Type
LinearHM(NN, predictors, forcing, targets, β)A linear hybrid model with a neural network NN, predictors, forcing and targets terms.
EasyHybrid.LinearHM Method
LinearHM(NN, predictors, forcing, β)(ds_k)Model definition ŷ = α x + β
Apply the linear hybrid model to the input data ds_k (a KeyedArray with proper dimensions). The model uses the neural network NN to compute new α's based on the predictors and then computes ŷ using the forcing term x.
Returns:
A tuple containing the predicted values ŷ and a named tuple with the computed values of α and the state st.
Example:
using Lux
using EasyHybrid
using Random
using AxisKeys
ds_k = KeyedArray(rand(Float32, 3,4); data=[:a, :b, :c], sample=1:4)
m = Lux.Chain(Dense(2, 5), Dense(5, 1))
# Instantiate the model
# Note: The model is initialized with a neural network and the predictors and forcing terms
lh_model = LinearHM(m, (:a, :b), (:c,), 1.5f0)
rng = Random.default_rng()
Random.seed!(rng, 0)
ps, st = LuxCore.setup(rng, lh_model)
# Apply the model to the data
ŷ, αst = LuxCore.apply(lh_model, ds_k, ps, st)EasyHybrid.LoggingLoss Type
LoggingLossA structure to define a logging loss function for hybrid models.
Arguments
loss_types: A vector of loss specifications (Symbol, Function or Tuple)Symbol: predefined loss, e.g.
:mseFunction: custom loss function, e.g.
custom_lossTuple: function with args/kwargs:
(f, args): positional args, e.g.(weighted_loss, (0.5,))(f, kwargs): keyword args, e.g.(scaled_loss, (scale=2.0,))(f, args, kwargs): both, e.g.(complex_loss, (0.5,), (scale=2.0,))
training_loss: The loss specification to use during training (same format as above)extra_loss: Optional function(ŷ; kwargs...) -> scalarto add to training loss (default:nothing)agg: Function to aggregate losses across targets, e.g.sumormeantrain_mode: If true, usestraining_loss; otherwise usesloss_types.
Examples
# Simple predefined loss
logging = LoggingLoss(
loss_types=[:mse, :mae],
training_loss=:mse
)
# Custom loss function
custom_loss(ŷ, y) = mean(abs2, ŷ .- y)
logging = LoggingLoss(
loss_types=[:mse, custom_loss],
training_loss=custom_loss
)
# With arguments/kwargs
weighted_loss(ŷ, y, w) = w * mean(abs2, ŷ .- y)
scaled_loss(ŷ, y; scale=1.0) = scale * mean(abs2, ŷ .- y)
logging = LoggingLoss(
loss_types=[:mse, (weighted_loss, (0.5,)), (scaled_loss, (scale=2.0,))],
training_loss=(weighted_loss, (0.5,))
)EasyHybrid.PerTarget Type
PerTarget(losses)A wrapper to indicate that a tuple of losses should be applied on a per-target basis.
sourceEasyHybrid.RbQ10_2p Method
RbQ10_2p(NN, predictors, forcing, targets, Q10)(ds_k)Model definition ŷ = Rb(αᵢ(t)) * Q10^((T(t) - T_ref)/10)
ŷ (respiration rate) is computed as a function of the neural network output Rb(αᵢ(t)) and the temperature T(t) adjusted by the reference temperature T_ref (default 15°C) using the Q10 temperature sensitivity factor. ````
EasyHybrid.RecurrenceOutputDense Type
RecurrenceOutputDense(in_dims => out_dims, [activation])A layer that wraps a Dense layer to handle sequence outputs from Recurrence layers.
When a Recurrence layer has return_sequence=true, it outputs a tuple/vector of arrays (one per timestep). This layer broadcasts the Dense operation over each timestep and reshapes the result to (features, timesteps, batch) format.
Arguments
in_dims::Int: Input dimension (should match Recurrence output dimension)out_dims::Int: Output dimensionactivation: Activation function (default:identity)
Example
# Instead of manually creating:
broadcast_layer = @compact(; layer = Dense(15 => 15)) do x
y = map(layer, x)
@return permutedims(stack(y; dims = 3), (1, 3, 2))
end
# Simply use:
Chain(
Recurrence(LSTMCell(15 => 15), return_sequence = true),
RecurrenceOutputDense(15 => 15)
)EasyHybrid.RespirationRbQ10 Type
RespirationRbQ10(NN, predictors, forcing, targets, Q10)A linear hybrid model with a neural network NN, predictors, targets and forcing terms.
EasyHybrid.RespirationRbQ10 Method
RespirationRbQ10(NN, predictors, forcing, targets, Q10)(ds_k)Model definition ŷ = Rb(αᵢ(t)) * Q10^((T(t) - T_ref)/10)
ŷ (respiration rate) is computed as a function of the neural network output Rb(αᵢ(t)) and the temperature T(t) adjusted by the reference temperature T_ref (default 15°C) using the Q10 temperature sensitivity factor. ````
EasyHybrid.Rs_components Type
Rs_components(NN, predictors, forcing, targets, Q10_het, Q10_root, Q10_myc)A linear hybrid model with a neural network NN, predictors, targets and forcing terms.
EasyHybrid.Rs_components Method
Rs_components(NN, predictors, forcing, targets, Q10)(ds_k)Model definition ŷ = Rb(αᵢ(t)) * Q10^((T(t) - T_ref)/10)
ŷ (respiration rate) is computed as a function of the neural network output Rb(αᵢ(t)) and the temperature T(t) adjusted by the reference temperature T_ref (default 15°C) using the Q10 temperature sensitivity factor. ````
EasyHybrid.TrainConfig Type
Configuration for training a hybrid model.
Controls all aspects of the training process including optimization, loss computation, data handling, output, and visualization.
sourceEasyHybrid.TrainResults Type
Output of train, containing the full training history, model state, and diagnostics.
EasyHybrid.WrappedTuples Type
WrappedTuples(vec::Vector{<:NamedTuple})Wraps a vector of named tuples to allow dot-access to each field as a vector.
sourceEasyHybrid._apply_loss Function
_apply_loss(ŷ, y, y_nan, loss_spec)Helper function to apply the appropriate loss function based on the specification type.
Arguments
ŷ: Predictions for a single targety: Target values for a single targety_nan: NaN mask for a single targetloss_spec: Loss specification (Symbol, Function, or Tuple)
Returns
- Computed loss value
EasyHybrid._compute_loss Function
_compute_loss(ŷ, y, y_nan, targets, loss_spec, agg::Function)
_compute_loss(ŷ, y, y_nan, targets, loss_types::Vector, agg::Function)Compute the loss for the given predictions and targets using the specified training loss (or vector of losses) type and aggregation function.
Arguments:
ŷ: Predicted values.y: Target values.y_nan: Mask for NaN values.targets: The targets for which the loss is computed.loss_spec: The loss type to use during training, e.g.,:mse.loss_types::Vector: A vector of loss types to compute, e.g.,[:mse, :mae].agg::Function: The aggregation function to apply to the computed losses, e.g.,sumormean.
Returns a single loss value if loss_spec is provided, or a NamedTuple of losses for each type in loss_types.
EasyHybrid._get_target_nan Function
_get_target_nan(y_nan, target)Helper function to extract target-specific values from y_nan.
EasyHybrid._get_target_y Function
_get_target_y(y, target)Helper function to extract target-specific values from y, handling cases where y can be a tuple of (y_obs, y_sigma).
EasyHybrid.build_parameter_matrix Method
build_parameter_matrix(parameter_defaults_and_bounds::NamedTuple)Build a ComponentArray matrix from a NamedTuple containing parameter defaults and bounds.
This function converts a NamedTuple where each value is a tuple of (default, lower, upper) bounds into a ComponentArray with named axes for easy parameter management in hybrid models.
Arguments
parameter_defaults_and_bounds::NamedTuple: A NamedTuple where each key is a parameter name and each value is a tuple of (default, lower, upper) for that parameter.
Returns
ComponentArray: A 2D ComponentArray with:Row axis: Parameter names (from the NamedTuple keys)
Column axis: Bound types (:default, :lower, :upper)
Data: The parameter values organized in a matrix format
Example
# Define parameter defaults and bounds
parameter_defaults_and_bounds = (
θ_s = (0.464f0, 0.302f0, 0.700f0), # Saturated water content [cm³/cm³]
h_r = (1500.0f0, 1500.0f0, 1500.0f0), # Pressure head at residual water content [cm]
α = (log(0.103f0), log(0.01f0), log(7.874f0)), # Shape parameter [cm⁻¹]
n = (log(3.163f0 - 1), log(1.100f0 - 1), log(20.000f0 - 1)), # Shape parameter [-]
)
# Build the ComponentArray
parameter_matrix = build_parameter_matrix(parameter_defaults_and_bounds)
# Access specific parameter bounds
parameter_matrix.θ_s.default # Get default value for θ_s
parameter_matrix[:, :lower] # Get all lower bounds
parameter_matrix[:, :upper] # Get all upper boundsNotes
The function expects each value in the NamedTuple to be a tuple with exactly 3 elements
The order of bounds is always (default, lower, upper)
The resulting ComponentArray can be used for parameter optimization and constraint handling
EasyHybrid.build_parameters Method
build_parameters(parameters::NamedTuple, f::DataType) -> AbstractHybridModelConstructs a parameter container from a named tuple of parameter bounds and wraps it in a user-defined subtype of AbstractHybridModel.
Arguments
parameters::NamedTuple: Named tuple where each entry is a tuple of (default, lower, upper) bounds for a parameter.f::DataType: A constructor for a subtype ofAbstractHybridModelthat takes aParameterContaineras its argument.
Returns
- An instance of the user-defined
AbstractHybridModelsubtype containing the parameter container.
EasyHybrid.compute_loss Method
compute_loss(HM, x, (y_t, y_nan), ps, st, logging::LoggingLoss)Main loss function for hybrid models that handles both training and evaluation modes.
Arguments
HM: The hybrid model (AbstractLuxContainerLayer or specific model type)x: Input data for the model(y_t, y_nan): Tuple containing target values and NaN mask functions/arraysps: Model parametersst: Model statelogging: LoggingLoss configuration
Returns
In training mode (
logging.train_mode = true):(loss_value, st): Single loss value and updated state
In evaluation mode (
logging.train_mode = false):(loss_values, st, ŷ): NamedTuple of losses, state and predictions
EasyHybrid.constructNNModel Method
constructNNModel(predictors, targets; hidden_layers, activation, scale_nn_outputs)Main constructor: hidden_layers can be either • a Vector{Int} of sizes, or • a Chain of hidden-layer Dense blocks.
EasyHybrid.filter_sequences Method
filter_sequences(x, y) -> (x_filtered, y_filtered)Drop 3rd-dim samples where any predictor is NaN or all targets are NaN.
sourceEasyHybrid.get_loss_value Method
get_loss_value(losses, loss_spec, agg)Extract loss value from losses based on the loss specification type.
Arguments
losses: NamedTuple containing loss valuesloss_spec: Loss specification (Symbol, Function or Tuple)agg: Aggregation function name as Symbol
Returns
- Loss value for the specified loss function
Examples
# Symbol case
val = get_loss_value(losses, :mse, :sum)
# Function case
custom_loss(ŷ, y) = mean(abs2, ŷ .- y)
val = get_loss_value(losses, custom_loss, :mean)
# Tuple case
weighted_loss(ŷ, y, w) = w * mean(abs2, ŷ .- y)
val = get_loss_value(losses, (weighted_loss, (0.5,)), :sum)EasyHybrid.get_prediction_target_names Method
get_prediction_target_names(hm)Utility function to extract predictor/forcing and target names from a hybrid model.
Arguments:
hm: The Hybrid Model
Returns a tuple of (predictors_forcing, targets) names.
sourceEasyHybrid.initialize_plotting_observables Method
initialize_plotting_observables(init_ŷ_train, init_ŷ_val, y_train, y_val, l_init_train, l_init_val, training_loss, agg, monitor_names, target_names)Initialize plotting observables for training visualization if the Makie extension is loaded.
sourceEasyHybrid.load_timeseries_netcdf Method
load_timeseries_netcdf(path::AbstractString; timedim::AbstractString = "time") -> DataFrameReads a NetCDF file where all data variables are 1D over the specified timedim and returns a tidy DataFrame with one row per time step.
Only includes variables whose sole dimension is
timedim.Does not attempt to parse or convert time units; all columns are read as-is.
EasyHybrid.loss_fn Function
loss_fn(ŷ, y, y_nan, loss_type)Compute the loss for given predictions and targets using various loss specifications.
Arguments
ŷ: Predicted valuesy: Target valuesy_nan: Mask for NaN valuesloss_type: One of the following:Val(:rmse): Root Mean Square ErrorVal(:mse): Mean Square ErrorVal(:mae): Mean Absolute ErrorVal(:pearson): Pearson correlation coefficientVal(:r2): R-squaredVal(:pearsonLoss): 1 - Pearson correlation coefficientVal(:nseLoss): 1 - NSE::Function: Custom loss function with signaturef(ŷ, y)::Tuple{Function, Tuple}: Custom loss with argsf(ŷ, y, args...)::Tuple{Function, NamedTuple}: Custom loss with kwargsf(ŷ, y; kwargs...)::Tuple{Function, Tuple, NamedTuple}: Custom loss with bothf(ŷ, y, args...; kwargs...)
Examples
# Predefined loss
loss = loss_fn(ŷ, y, y_nan, Val(:mse))
# Custom loss function
custom_loss(ŷ, y) = mean(abs2, ŷ .- y)
loss = loss_fn(ŷ, y, y_nan, custom_loss)
# With positional arguments
weighted_loss(ŷ, y, w) = w * mean(abs2, ŷ .- y)
loss = loss_fn(ŷ, y, y_nan, (weighted_loss, (0.5,)))
# With keyword arguments
scaled_loss(ŷ, y; scale=1.0) = scale * mean(abs2, ŷ .- y)
loss = loss_fn(ŷ, y, y_nan, (scaled_loss, (scale=2.0,)))
# With both args and kwargs
complex_loss(ŷ, y, w; scale=1.0) = scale * w * mean(abs2, ŷ .- y)
loss = loss_fn(ŷ, y, y_nan, (complex_loss, (0.5,), (scale=2.0,)))You can define additional predefined loss functions by adding more methods:
import EasyHybrid: loss_fn
function EasyHybrid.loss_fn(ŷ, y, y_nan, ::Val{:nse})
return 1 - sum((ŷ[y_nan] .- y[y_nan]).^2) / sum((y[y_nan] .- mean(y[y_nan])).^2)
endEasyHybrid.make_folds Method
make_folds(df::DataFrame; k::Int=5, shuffle=true) -> Vector{Int}Assigns each observation in the DataFrame df to one of k folds for cross-validation.
Arguments
df::DataFrame: The input DataFrame whose rows are to be split into folds.k::Int=5: Number of folds to create.shuffle=true: Whether to shuffle the data before assigning folds.
Returns
folds::Vector{Int}: A vector of lengthnrow(df)where each entry is an integer in1:kindicating the fold assignment for that observation.
EasyHybrid.prepare_data Function
prepare_data(hm, data::DataFrame; array_type=:KeyedArray, drop_missing_rows=true)
prepare_data(hm, data::KeyedArray)
prepare_data(hm, data::AbstractDimArray)
prepare_data(hm, data::Tuple)Prepare data for training by extracting predictor/forcing and target variables based on the hybrid model's configuration.
Arguments:
hm: The Hybrid Modeldata: The input data, which can be a DataFrame, KeyedArray, or DimensionalData array.array_type: (DataFrame only) Output array type::KeyedArray(default) or:DimArray.drop_missing_rows: (DataFrame only) Iftrue(default), drop rows where any predictor is NaN or all targets are NaN.
Returns:
If
datais a DataFrame: a tuple of (predictors_forcing, targets) as KeyedArrays or DimArrays depending onarray_type.If
datais a KeyedArray: a tuple of (predictors_forcing, targets) as KeyedArrays.If
datais an AbstractDimArray: a tuple of (predictors_forcing, targets) as DimArrays.If
datais already a Tuple, it is returned as-is.
Method
prepare_hidden_chain(hidden_layers, in_dim, out_dim; activation, input_batchnorm=false)Construct a neural network Chain for use in NN models.
Arguments
hidden_layers::Union{Vector{Int}, Chain}:If a
Vector{Int}, specifies the sizes of each hidden layer. For example,[32, 16]creates two hidden layers with 32 and 16 units, respectively.If a
Chain, the user provides a pre-built chain of hidden layers (excluding input/output layers). If the chain ends with aRecurrencelayer, aRecurrenceOutputDenselayer is automatically added to handle the sequence output format.
in_dim::Int: Number of input features (input dimension).out_dim::Int: Number of output features (output dimension).activation: Activation function to use in hidden layers (default:tanh).input_batchnorm::Bool: Iftrue, applies aBatchNormlayer to the input (default:false).
Returns
- A
Chainobject representing the full neural network, with the following structure:Optional input batch normalization (if
input_batchnorm=true)Input layer:
Dense(in_dim, h₁, activation)whereh₁is the first hidden sizeHidden layers: either user-supplied
Chainor constructed fromhidden_layersIf last hidden layer is a
Recurrence, aRecurrenceOutputDenseis added to handle sequence outputOutput layer:
Dense(hₖ, out_dim)wherehₖis the last hidden size
where h₁ is the first hidden size and hₖ the last.
Example with Recurrence (LSTM)
# User only needs to define:
NN_Memory = Chain(
Recurrence(LSTMCell(15 => 15), return_sequence = true),
)
# The function automatically adds the RecurrenceOutputDense layer to handle sequence output
model = constructHybridModel(..., hidden_layers = NN_Memory, ...)EasyHybrid.scale_single_param Method
scale_single_param(name, raw_val, parameters)Scale a single parameter using the sigmoid scaling function.
sourceEasyHybrid.scale_single_param_minmax Method
scale_single_param_minmax(name, hm::AbstractHybridModel)Scale a single parameter using the minmax scaling function.
sourceEasyHybrid.split_data Function
split_data(data, hybridModel; split_by_id=nothing, shuffleobs=false, split_data_at=0.8, kwargs...)
split_data(data::Union{DataFrame, KeyedArray}, hybridModel; split_by_id=nothing, shuffleobs=false, split_data_at=0.8, folds=nothing, val_fold=nothing, kwargs...)
split_data(data::AbstractDimArray, hybridModel; split_by_id=nothing, shuffleobs=false, split_data_at=0.8, kwargs...)
split_data(data::Tuple, hybridModel; split_by_id=nothing, shuffleobs=false, split_data_at=0.8, kwargs...)
split_data(data::Tuple{Tuple, Tuple}, hybridModel; kwargs...)Split data into training and validation sets, either randomly, by grouping by ID, or using external fold assignments.
Arguments:
data: The data to split, which can be a DataFrame, KeyedArray, AbstractDimArray, or TuplehybridModel: The hybrid model object used for data preparationsplit_by_id=nothing: Eithernothingfor random splitting, aSymbolfor column-based splitting, or anAbstractVectorfor custom ID-based splittingshuffleobs=false: Whether to shuffle observations during splittingsplit_data_at=0.8: Ratio of data to use for trainingfolds: Vector or column name of fold assignments (1..k), one per sample/column for k-fold cross-validationval_fold: The validation fold to use whenfoldsis providedsequence_kwargs=nothing: NamedTuple of keyword arguments forwarded tosplit_into_sequences(e.g.(; input_window=10, output_window=1, output_shift=1, lead_time=2)). When set, data is windowed into 3D sequences before splitting.
Behavior:
For DataFrame/KeyedArray: Supports random splitting, ID-based splitting, and external fold assignments
For AbstractDimArray/Tuple: Random splitting only after data preparation
For pre-split Tuple{Tuple, Tuple}: Returns input unchanged
Returns:
((x_train, y_train), (x_val, y_val)): Tuple containing training and validation data pairs
EasyHybrid.split_data Method
split_data(df::DataFrame, target, xvars, seqID; f=0.8, batchsize=32, shuffle=true, partial=true)
sourceEasyHybrid.split_data Method
split_data(df::DataFrame, target, xvars; f=0.8, batchsize=32, shuffle=true, partial=true)
sourceEasyHybrid.split_into_sequences Method
split_into_sequences(x, y; input_window=5, output_window=1, output_shift=1, lead_time=1)Slide a (input_window + lead_time) window over 2D (feature, time) arrays to produce 3D (feature, time, batch) tensors for sequence-to-sequence training.
Arguments:
x: 2D input array(feature, time).y: 2D target array(target, time).input_window: number of input time steps per sample.output_window: number of target time steps per sample.output_shift: stride between consecutive samples.lead_time: gap between end of input window and end of output window.
Returns:
(X, Y)as KeyedArrays or DimArrays matching the input type.
EasyHybrid.toDataFrame Method
toDataFrame(arr, target_names)Extract specific target variables from a labeled array into a DataFrame with _pred suffix.
Arguments
arr: A labeled array or NamedTuple-like object with property accesstarget_names: Vector of target variable names to extract
Returns
DataFramewith columns named<target>_predfor each target
EasyHybrid.toDataFrame Method
toDataFrame(arr::Union{KeyedArray{T, 2}, AbstractDimArray{T, 2}}, cols_dim=:variable, index_dim=:batch_size; index_col=:index)Convert a 2D labeled array (KeyedArray or DimArray) to a DataFrame.
Arguments
arr: The 2D labeled array to convertcols_dim: Dimension name to use as DataFrame columns (default::variable)index_dim: Dimension name to use as DataFrame row index (default::batch_size)index_col: Name for the index column in the result (default::index)
Returns
DataFramewith columns fromcols_dimkeys and an index column fromindex_dimkeys
EasyHybrid.toDataFrame Method
toDataFrame(arr::AbstractLabeledArray{T, 3}, cols_dim=:variable, index_dim=:batch_size; slice_dim=:time, index_col=:index)Convert a 3D labeled array (KeyedArray or DimArray) to a Dict of DataFrames, one per slice.
Arguments
arr: The 3D labeled array to convertcols_dim: Dimension name to use as DataFrame columns (default::variable)index_dim: Dimension name to use as DataFrame row index (default::batch_size)slice_dim: Dimension name to slice along (default::time)index_col: Name for the index column in each result DataFrame (default::index)
Returns
Dict{Any, DataFrame}mapping slice keys to DataFrames
EasyHybrid.toNamedTuple Method
toNamedTuple(ka::KeyedArray, variable::Symbol) Extract a single variable from a KeyedArray and return it as a vector.
Arguments:
ka: The KeyedArray or DimArray to unpackvariable: Symbol representing the variable to extract
Returns:
- Vector containing the variable data
Example:
# Extract just SW_IN from an array
sw_in = toNamedTuple(ds, :SW_IN)EasyHybrid.toNamedTuple Method
toNamedTuple(ka::Union{KeyedArray, AbstractDimArray}, variables::Vector{Symbol})Extract specified variables from a KeyedArray or DimArray and return them as a NamedTuple of vectors.
Arguments:
ka: The KeyedArray or DimArray to unpackvariables: Vector of symbols representing the variables to extract
Returns:
- NamedTuple with variable names as keys and vectors as values
Example:
# Extract SW_IN and TA from an array
data = toNamedTuple(ds, [:SW_IN, :TA])
sw_in = data.SW_IN
ta = data.TAEasyHybrid.toNamedTuple Method
toNamedTuple(ka::KeyedArray) Extract all variables from a KeyedArray and return them as a NamedTuple of vectors.
Arguments:
ka: The KeyedArray to unpack
Returns:
- NamedTuple with all variable names as keys and vectors as values
Example:
# Extract all variables from an array
data = toNamedTuple(ds)
# Access individual variables
sw_in = data.SW_IN
ta = data.TA
nee = data.NEEEasyHybrid.to_keyedArray Function
tokeyedArray(dfg::Union{Vector,GroupedDataFrame{DataFrame}}, vars=All())
sourceEasyHybrid.train Method
train(model, data; train_cfg::TrainConfig = TrainConfig(), data_cfg::DataConfig = DataConfig())Train a hybrid model using the provided data.
Returns nothing if data preparation fails (zero-size dimension in training or validation data).
Arguments
model: The hybrid model to train.data: Training data, a singleDimArray, a singleDataFrame, a singleKeyedArray, or a tuple of those.
Keyword Arguments
train_cfg: Training configuration. SeeTrainConfigfor all options.data_cfg: Data preparation configuration. SeeDataConfigfor all options.
Returns
A TrainResults with the following fields:
train_losses: Per-epoch training losses.val_losses: Per-epoch validation losses.snapshots: Model parameter snapshots taken during training.train_obs_pred: Observed vs. predicted values on the training set.val_obs_pred: Observed vs. predicted values on the validation set.train_diffs: Additional diagnostic variables computed on the training set.val_diffs: Additional diagnostic variables computed on the validation set.ps: Final (or best) model parameters.st: Final (or best) model state.best_epoch: Epoch at which the best validation loss was achieved.best_loss: Best validation loss recorded during training.
Example
cfg = TrainConfig(nepochs=100, batchsize=32)
result = train(myModel, myData; train_cfg=cfg)EasyHybrid.@hybrid Macro
@hybrid ModelName α β γMacro to define hybrid model structs with arbitrary numbers of physical parameters.
This defines a struct with:
Default fields:
NN(neural network),predictors,forcing,targets.Additional physical parameters, i.e.,
α β γ.
Examples
@hybrid MyModel α β γ
@hybrid FluidModel (:viscosity, :density)
@hybrid SimpleModel :a :b