|
|
第一步:(拿全部经书编号)
帮助文档:https://cbdata.dila.edu.tw/stable/static_pages/export#all_works
经书编号,标题,都是可以可以和这里一一对应的:
线上阅读地址:https://cbetaonline.dila.edu.tw/zh/
全部佛典 卷列表(可以拿到所有经书的编号[work]的值,可以为后面的经书详情页链接进行拼接)
API:https://cbdata.dila.edu.tw/stable/export/all_works
返回数据格式:
如果juans参数数组存在多个键值的,代表有分卷,需要继续循环拼接
- {"work": "T0001", "title": "長阿含經", "juans": ["1", "2", "3", "4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15", "16", "17", "18", "19", "20", "21", "22"]},
- {"work": "T0002", "title": "七佛經", "juans": ["1"]},
- {"work": "T0003", "title": "毘婆尸佛經", "juans": ["1", "2"]},
- {"work": "T0004", "title": "七佛父母姓字經", "juans": ["1"]},
- {"work": "T0005", "title": "佛般泥洹經", "juans": ["1", "2"]},
- {"work": "T0006", "title": "般泥洹經", "juans": ["1", "2"]},
- {"work": "T0007", "title": "大般涅槃經", "juans": ["1", "2", "3"]},
- {"work": "T0008", "title": "大堅固婆羅門緣起經", "juans": ["1", "2"]},
- {"work": "T0009", "title": "人仙經", "juans": ["1"]},
- {"work": "T0010", "title": "白衣金幢二婆羅門緣起經", "juans": ["1", "2", "3"]},
- {"work": "T0159", "title": "大乘本生心地觀經", "juans": ["1", "2", "3", "4", "5", "6", "7", "8"]}
复制代码
第二步:(根据经书编号拿每一步经书内容)
取得某卷 HTML
帮助文档:https://cbdata.dila.edu.tw/stable/static_pages/get_html
API:https://cbdata.dila.edu.tw/stabl ... &juan=1&work_info=1
第三步:(过滤经书内容)
拿到经文详情之后,需要各种正则过滤html的,需要参考HTML for UI 格式說明
帮助文档:https://cbdata.dila.edu.tw/stable/static_pages/html_for_ui
....后续数据逻辑继续处理,如:入库
|
|