AI写算法：ARIMA

05-14 1066阅读

当使用Python进行ARIMA（自回归积分滑动平均模型）建模时，通常会使用statsmodels库。以下是一个简单的ARIMA模型的Python代码示例：

（图片来源网络，侵删）

python复制代码

	import pandas as pd
	from statsmodels.tsa.arima.model import ARIMA
	from statsmodels.tsa.stattools import adfuller
	import matplotlib.pyplot as plt

	# 假设你有一个名为'data.csv'的CSV文件，其中包含你要分析的时间序列数据
	# 数据应该有一个名为'date'的日期列和一个名为'value'的值列

	# 读取数据
	df = pd.read_csv('data.csv', parse_dates=['date'], index_col='date')

	# 确保数据是数值型的
	df['value'] = pd.to_numeric(df['value'], errors='coerce')

	# 检查数据是否有缺失值，并删除它们（如果有的话）
	df = df.dropna()

	# 检查数据的平稳性（可选）
	# 使用ADF检验（Augmented Dickey-Fuller test）
	result = adfuller(df['value'])
	print('ADF Statistic: %f' % result[0])
	print('p-value: %f' % result[1])

	# 如果数据不是平稳的，你可能需要进行差分或其他转换
	# 但为了简单起见，我们假设数据已经是平稳的

	# 拟合ARIMA模型
	# 这里我们使用ARIMA(1, 1, 0)作为示例，但你应该根据数据选择合适的p, d, q值
	model = ARIMA(df['value'], order=(1, 1, 0))
	model_fit = model.fit(disp=0)

	# 输出模型摘要
	print(model_fit.summary())

	# 预测未来几个时间点的值（例如，预测接下来的10个值）
	n_steps = 10
	forecast, stderr, conf_int = model_fit.forecast(steps=n_steps)

	# 绘制原始数据和预测数据
	plt.figure(figsize=(10, 5))
	plt.plot(df['value'], label='Original')
	plt.plot(range(len(df['value']), len(df['value']) + n_steps), forecast, color='red', label='Forecast')
	plt.fill_between(range(len(df['value']), len(df['value']) + n_steps),
	conf_int[:, 0],
	conf_int[:, 1], color='m', alpha=.15)
	plt.title('ARIMA Model Forecast')
	plt.xlabel('Time')
	plt.ylabel('Value')
	plt.legend()
	plt.show()