文档反馈

问题文档片段

问题文档片段包含公式时，显示为空格。

提交类型

issue

有点复杂...

找人问问吧。

PR

小问题，全程线上修改...

一键搞定！

请选择提交类型

问题类型

规范和低错类

- 规范和低错类：

- 错别字或拼写错误，标点符号使用错误、公式错误或显示异常。

- 链接错误、空单元格、格式错误。

- 英文中包含中文字符。

- 界面和描述不一致，但不影响操作。

- 表述不通顺，但不影响理解。

- 版本号不匹配：如软件包名称、界面版本号。

易用性

- 易用性：

- 关键步骤错误或缺失，无法指导用户完成任务。

- 缺少主要功能描述、关键词解释、必要前提条件、注意事项等。

- 描述内容存在歧义指代不明、上下文矛盾。

- 逻辑不清晰，该分类、分项、分步骤的没有给出。

正确性

- 正确性：

- 技术原理、功能、支持平台、参数类型、异常报错等描述和软件实现不一致。

- 原理图、架构图等存在错误。

- 命令、命令参数等错误。

- 代码片段错误。

- 命令无法完成对应功能。

- 界面错误，无法指导操作。

- 代码样例运行报错、运行结果不符。

风险提示

- 风险提示：

- 对重要数据或系统存在风险的操作，缺少安全提示。

内容合规

- 内容合规：

- 违反法律法规，涉及政治、领土主权等敏感词。

- 内容侵权。

请选择问题类型

问题描述

点击输入详细问题描述，以帮助我们快速定位问题。

文档反馈

使用SciAI构建神经网络

SciAI基础框架由若干基础模块构成，涵盖有神经网络搭建、训练、验证以及其他辅助函数等。

如下的示例展示了使用SciAI构建神经网络模型并进行训练的流程。

你可以在这里下载完整的样例代码： https://gitee.com/mindspore/mindscience/tree/r0.5/SciAI/tutorial

模型构建基础

使用SciAI基础框架创建神经网络的原理与使用MindSpore构建网络一致，但过程将会十分简便。

本章节以一个多层感知器为例，介绍了使用SciAI训练并求解如下方程。

f (x) = {x_{1}}^{2} + s i n (x_{2})

该部分完整代码请参考代码。

模型搭建

如下示例代码创建了一个输入维度为2，输出维度为1，包含两层维度为5的中间层的多层感知器。

from sciai.architecture import MLP
from sciai.common.initializer import XavierTruncNormal

net = MLP(layers=[2, 5, 5, 1], weight_init=XavierTruncNormal(), bias_init='zeros', activation="tanh")

MLP将默认使用正态分布随机生成网络权重，偏差bias默认为0，激活函数默认为tanh。

MLP同时接受多样化的初始化方式和MindSpore提供的所有激活函数，以及专为科学计算设计的激活函数。

损失函数定义

损失函数定义为Cell的子类，并将损失的计算方法写在方法construct中。

from mindspore import nn
from sciai.architecture import MSE

class ExampleLoss(nn.Cell):
    def __init__(self, network):
        super().__init__()
        self.network = network
        self.mse = MSE()

    def construct(self, x, y_true):
        y_predict = self.network(x)
        return self.mse(y_predict - y_true)

net_loss = ExampleLoss(net)

此时，通过直接调用net_loss，并将输入x与真实值y_true作为参数，便可计算得到当前net预测的损失。

from mindspore import Tensor

x = Tensor([[0.5, 0.5]])
y_true = Tensor([0.72942554])
print("loss value: ", net_loss(x, y_true))
# expected output
...
loss value: 0.3026065

模型训练与推理

得到损失函数后，我们即可使用SciAI框架中已封装好的训练类，使用数据集进行训练。在本案例中，我们对方程进行随机采样，生成数据集x_train与y_true进行训练。

模型训练部分代码如下所示，其中主要展示了SciAI若干功能。模型训练类TrainCellWithCallBack，其与MindSpore.nn.TrainOneStepCell功能基本一致，需要提供网络net_loss与优化器作为参数，并为科学计算功能增加了回调功能。回调包括打印训练loss值、训练时间、自动保存ckpt文件。如下的案例代码将会每100个训练周期打印loss值与训练时间，并在每1000个训练周期保存当前模型参数为ckpt文件。 SciAI提供to_tensor工具，可以方便地将多个numpy数据同时转换为Tensor类型。使用log_config指定目标目录，用于自动保存TrainCellWithCallBack的回调打印，以及用户使用print_log所打印的内容。

import numpy as np
from mindspore import nn
from sciai.common import TrainCellWithCallBack
from sciai.context import init_project
from sciai.utils import to_tensor, print_log, log_config

# Get the correct platform automatically and set to GRAPH_MODE by default.
init_project()
# Auto log saving
log_config("./logs")

def func(x):
    """The function to be learned to"""
    return x[:, 0:1] ** 2 + np.sin(x[:, 1:2])

optimizer = nn.Adam(net_loss.trainable_params())
trainer = TrainCellWithCallBack(net_loss, optimizer, loss_interval=100, time_interval=100, ckpt_interval=1000)
x_train = np.random.rand(1000, 2)
# Randomly collect ground truth
y_true = func(x_train)
# Convert to Tensor data type
x_train, y_true = to_tensor((x_train, y_true))
for _ in range(10001):
    trainer(x_train, y_true)
print_log("Finished")

预期运行结果如下。

python ./example_net.py
# expected output
...
step: 0, loss: 0.5189553, interval: 2.7039313316345215s, total: 2.7039313316345215s
step: 100, loss: 0.080132075, interval: 0.11984062194824219s, total: 2.8237719535827637s
step: 200, loss: 0.055663396, interval: 0.09104156494140625s, total: 2.91481351852417s
step: 300, loss: 0.032194577, interval: 0.09095025062561035s, total: 3.0057637691497803s
step: 400, loss: 0.015914217, interval: 0.09099435806274414s, total: 3.0967581272125244s
...
Finished

在训练结束并且损失收敛时，通过调用y = net(x)即可得到x处的预测值y。继续随机采样若干位置x_val用于验证。

x_val = np.random.rand(5, 2)
y_true = func(x_val)
y_pred = net(to_tensor(x_val)).asnumpy()
print_log("y_true:")
print_log(y_true)
print_log("y_pred:")
print_log(y_pred)

预期运行结果如下。经过训练，模型的预测值接近数值计算结果。

# expected output
y_true:
[[0.34606973]
 [0.70457536]
 [0.90531053]
 [0.84420218]
 [0.48239506]]
y_pred:
[[0.34271246]
 [0.70356864]
 [0.89893466]
 [0.8393946 ]
 [0.47805673]]

模型构建拓展

使用SciAI可以求解更为复杂的问题，例如物理驱动的神经网络（PINN）。该章节继续以一个多层感知器为例，介绍使用SciAI训练并求解如下偏微分方程。

\frac{\partial f}{\partial x} - 2 \frac{f}{x} + x^{2} y^{2} = 0

边界条件定义如下。

f (0) = 0, f (1) = 1

在此边界条件下，函数的解析解为：

f (x) = \frac{x^{2}}{0.2 x^{5} + 0.8}

该部分完整代码请参考代码。

损失函数定义

与上一章中损失函数定义基本一致，需要定义损失为Cell的子类。

不同的是在该损失函数中，需要计算原函数的偏导。 SciAI为此提供了便捷的工具operators.grad，通过设置网络输入与输出的索引，可以计算某个输入对某个输出的偏导值。在该问题中，输入输出维度均为1，因此设置input_index与output_index为0。

from mindspore import nn, ops
from sciai.architecture import MSE, MLP
from sciai.operators import grad

class ExampleLoss(nn.Cell):
    """ Loss definition class"""
    def __init__(self, network):
        super().__init__()
        self.network = network
        self.dy_dx = grad(net=self.network, output_index=0, input_index=0)  # partial differential definition
        self.mse = MSE()

    def construct(self, x, x_bc, y_bc_true):
        y = self.network(x)
        dy_dx = self.dy_dx(x)
        domain_res = dy_dx - 2 * ops.div(y, x) + ops.mul(ops.pow(x, 2), ops.pow(y, 2))  # PDE residual error

        y_bc = self.network(x_bc)
        bc_res = y_bc_true - y_bc  # Boundary conditions residual
        return self.mse(domain_res) + 10 * self.mse(bc_res)

模型训练与推理

通过终端执行脚本文件，执行训练与推理，得到如下预期结果。最终预测值y_pred与真实值y_true基本接近。

python ./example_grad_net.py
# expected output
...
step: 0, loss: 3.1961572, interval: 3.117840051651001s, total: 3.117840051651001s
step: 100, loss: 1.0862937, interval: 0.23533344268798828s, total: 3.353173494338989s
step: 200, loss: 0.7334847, interval: 0.21307134628295898s, total: 3.566244840621948s
step: 300, loss: 0.5629723, interval: 0.19696831703186035s, total: 3.763213157653809s
step: 400, loss: 0.4133342, interval: 0.20153212547302246s, total: 3.964745283126831s
...
Finished
y_true:
[[0.02245186]
 [0.99459697]
 [0.04027248]
 [0.12594332]
 [0.39779923]]
y_pred:
[[0.02293926]
 [0.99337316]
 [0.03924912]
 [0.12166673]
 [0.4006738 ]]