我正在为熊猫的日期时间格式而苦苦挣扎。我的数据集如下(日期作为类型对象):
+--------+------------+----------+---------------------------+---------------------+
| | event_id_x | payback | event_starts_utc_datetime | dtScraped |
+--------+------------+----------+---------------------------+---------------------+
| 80325 | 1004179030 | 0.980840 | 2017-09-13 20:45:03.888 | 2017-09-06 17:06:32 |
| 104592 | 1004179030 | 0.980840 | 2017-09-13 20:45:03.888 | 2017-09-06 19:23:56 |
| 261304 | 1004179030 | 0.980840 | 2017-09-13 20:45:03.888 | 2017-09-07 06:21:47 |
| 657433 | 1004179030 | 0.980840 | 2017-09-13 20:45:03.888 | 2017-09-08 13:06:05 |
| 661013 | 1004179030 | 0.979975 | 2017-09-13 20:45:03.888 | 2017-09-11 09:04:15 |
+--------+------------+----------+---------------------------+---------------------+
我想将event_starts_utc_datetime和dtScraped传递给datetime,但是以下返回“ ValueError:时间数据'event_starts_utc_datetime'与指定的格式不匹配”
pinny_payback["event_starts_utc_datetime"] = pd.to_datetime(["event_starts_utc_datetime"], format='%Y-%m-%d %H:%M:%S.%f')
你能帮忙吗?
这是一些用于创建虚拟df的代码。我将其修改为使用逗号和pd.read_clipboard
。
"""
id,event_id_x,payback,event_starts_utc_datetime,dtScraped
80325,1004179030,0.980840,2017-09-13 20:45:03.888,2017-09-06 17:06:32
104592,1004179030,0.980840,2017-09-13 20:45:03.888,2017-09-06 19:23:56
261304,1004179030,0.980840,2017-09-13 20:45:03.888,2017-09-07 06:21:47
657433,1004179030,0.980840,2017-09-13 20:45:03.888,2017-09-08 13:06:05
661013,1004179030,0.979975,2017-09-13 20:45:03.888,2017-09-11 09:04:15
"""
import pandas as pd
df = pd.read_clipboard(sep = ',')
df['event_starts_utc_datetime'] = pd.to_datetime(df['event_starts_utc_datetime'], format='%Y-%m-%d %H:%M:%S.%f')
print(df.dtypes)
df
我怀疑您的错误可能只是您的pd.to_datetime()
声明中未包含数据框名称...
输出:
id int64
event_id_x int64
payback float64
event_starts_utc_datetime datetime64[ns]
dtScraped object
dtype: object
本文收集自互联网,转载请注明来源。
如有侵权,请联系 [email protected] 删除。
我来说两句