https://github.com/docling-project/docling/blob/main/docling/backend/msword_backend.py
https://www.zhihu.com/question/1961862463881451508/answer/1961873238784184663
https://docs.llamaindex.org.cn/en/stable/examples/retrievers/auto_merging_retriever/