Ac/gpu support by Qfl3x · Pull Request #256 · EarthyScience/EasyHybrid.jl

Qfl3x · 2026-04-02T13:21:03Z

No description provided.

…at needs to be done in the outer loop, refactoring genericHybrid is needed for that

gemini-code-assist

Code Review

This pull request introduces GPU support and refactors data handling to separate predictors and forcings throughout the training pipeline. Key changes include adding CUDA dependencies, updating configuration objects with device selectors, and modifying data loaders, splitters, and model forward passes to accommodate a new nested tuple input structure. Feedback highlights a mathematical error in the R-squared calculation, potential shape mismatches and incorrect NaN masking in the epoch loop, and several instances of dead code or typos. Additionally, a logic error was identified in a warning check within the data preparation module.

gemini-code-assist · 2026-04-02T13:23:16Z

src/losses/loss_fn.jl

 function loss_fn(ŷ, y, y_nan, ::Val{:r2})
-    r = cor(ŷ[y_nan], y[y_nan])
-    return r * r
+    return 1 - sum((y[y_nan] .- ŷ[y_nan]).^2) / sum((y[y_nan] .- mean(ŷ[y_nan])).^2)


The R-squared calculation is incorrect. The denominator should use the mean of the observed values (y), not the predicted values (ŷ). The standard definition of R² is $1 - SS_{res}/SS_{tot}$, where $SS_{tot}$ is calculated relative to the mean of the observations.

return 1 - sum((y[y_nan] .- ŷ[y_nan]).^2) / sum((y[y_nan] .- mean(y[y_nan])).^2)

gemini-code-assist · 2026-04-02T13:23:16Z

src/training/epoch.jl

+        is_no_nan = falses(length(first(y))) |> cfg.gdev
+        for vec in y 
+            is_no_nan = is_no_nan.|| .!isnan.(vec)
+        end


This logic has two significant issues:

Shape Mismatch: falses(length(first(y))) creates a 1D array. If the targets are multi-dimensional (e.g., (time, batch)), the bitwise OR operation .|| will fail. Use size(first(y)) instead of length.

Incorrect Masking: Computing a single global is_no_nan mask by ORing all targets is problematic. If target A has a NaN at an index where target B is valid, the global mask will be true at that index. Consequently, the loss for target A will be computed using the NaN value, resulting in a NaN total loss. Masks should be computed and applied per-target.

gemini-code-assist · 2026-04-02T13:23:16Z

src/data/prepare_data.jl

+    # predicto
+    # predictors_forcing = unique(predictors_forcing)


These lines contain an incomplete comment and commented-out code that should be removed to maintain code cleanliness.

gemini-code-assist · 2026-04-02T13:23:16Z

src/data/prepare_data.jl

        @warn "Note that you don't have target names."
    end
-    return predictors_forcing, targets
+    return predictors, forcings, targets


The function now returns predictors and forcings separately, but the warning check at line 115 (visible in context) still references predictors_forcing. Since predictors_forcing is initialized as an empty array at line 89 and never populated in the new logic, this warning will be triggered on every call. The check should be updated to verify if both predictors and forcings are empty.

gemini-code-assist · 2026-04-02T13:23:16Z

src/losses/compute_loss.jl

+                y_t = y[target]# _get_target_y(y, target)
+                ŷ_t = ŷ[target]#_get_target_ŷ(ŷ, y_t, target)


There is a typo in line 109: ŷ (y with combining circumflex) is used instead of the argument ŷ (U+0177) defined at line 105. While Julia normalizes identifiers to NFC, mixing these characters is confusing and can lead to issues in environments with different normalization rules. Additionally, the commented-out code should be removed.

y_t = y[target] ŷ_t = ŷ[target]

gemini-code-assist · 2026-04-02T13:23:16Z

src/training/initialization.jl

+        ps, st = LuxCore.setup(Random.default_rng(), model) |> cfg.gdev
+        ps = ps |> cfg.gdev


The line ps = ps |> cfg.gdev is redundant. The parameters ps and state st are already moved to the device as part of the piped operation in line 19.

ps, st = LuxCore.setup(Random.default_rng(), model) |> cfg.gdev

lazarusA and others added 3 commits March 20, 2026 12:32

gpu, cpu devices

4c415a5

pfuff, of course symbol or variable indexing will not work on gpu, th…

527e634

…at needs to be done in the outer loop, refactoring genericHybrid is needed for that

GPU support

c08172b

gemini-code-assist bot reviewed Apr 2, 2026

View reviewed changes

Merge branch 'main' into ac/gpu_support

dc98746

lazarusA mentioned this pull request Apr 3, 2026

gpu support #255

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ac/gpu support#256

Ac/gpu support#256
Qfl3x wants to merge 4 commits intomainfrom
ac/gpu_support

Qfl3x commented Apr 2, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

gemini-code-assist bot Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		y_t = y[target]# _get_target_y(y, target)
		ŷ_t = ŷ[target]#_get_target_ŷ(ŷ, y_t, target)

		ps, st = LuxCore.setup(Random.default_rng(), model) \|> cfg.gdev
		ps = ps \|> cfg.gdev

Conversation

Qfl3x commented Apr 2, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants