Pro 6.3.13 elasticsearch not indexing pdf-content

Hi,
I am struggling with elasticsearch indexing, happening on a dedicated background node (cluster + mysql 8, ceph backend). Logs look ok -> no errors. Manually deleteing and updating does not solve anything. Log suggests that files are being extracted and added to the index correctly but I can only find filenames not their content:

root@tbgnode:/data/seafile/seafile-pro-server-6.3.13# pro/pro.py search --clear
Delete seafile search index ([y]/n)? y

Delete search index, this may take a while…

/data/seafile/seafile-pro-server-6.3.13/seahub/thirdpart/requests/init.py:83: RequestsDependencyWarning: Old version of cryptography ([1, 2, 3]) may cause slowdown.
warnings.warn(warning, RequestsDependencyWarning)
08/20/2020 14:15:53 [INFO] root:210 main: storage: using Ceph storage backend
08/20/2020 14:15:53 [INFO] root:212 main: index office pdf: True
08/20/2020 14:15:53 [WARNING] seafes:176 delete_indices: deleting index repo_head
08/20/2020 14:15:53 [WARNING] seafes:176 delete_indices: deleting index repofiles
root@tbgnode:/data/seafile/seafile-pro-server-6.3.13# pro/pro.py search --update

Updating search index, this may take a while…

/data/seafile/seafile-pro-server-6.3.13/seahub/thirdpart/requests/init.py:83: RequestsDependencyWarning: Old version of cryptography ([1, 2, 3]) may cause slowdown.
warnings.warn(warning, RequestsDependencyWarning)
08/20/2020 14:15:58 [INFO] root:210 main: storage: using Ceph storage backend
08/20/2020 14:15:58 [INFO] root:212 main: index office pdf: True
08/20/2020 14:15:58 [INFO] seafes:163 start_index_local: Index process initialized.
08/20/2020 14:15:58 [INFO] seafes:46 run: starting worker0 worker threads for indexing
08/20/2020 14:15:58 [INFO] seafes:46 run: starting worker1 worker threads for indexing
/data/seafile/seafile-pro-server-6.3.13/pro/python/SQLAlchemy-1.1.3-py2.6-linux-x86_64.egg/sqlalchemy/engine/default.py:462: Warning: ‘utf8’ is currently an alias for the character set UTF8MB3, but will be an alias for UTF8MB4 in a future release. Please consider using UTF8MB4 in order to be unambiguous.
cursor.execute(statement, parameters)
08/20/2020 14:16:01 [INFO] seafes:71 update_repo: Updating repo 9f014d9f-83e6-4d88-a682-97a2a598ecd8
08/20/2020 14:16:01 [DEBUG] seafes:73 update_repo: latest_commit_id: 67c2a6036231f3c5390b9dbd82f9263c288796c8, status.from_commit: None
08/20/2020 14:16:01 [INFO] seafes:71 update_repo: Updating repo a0decb27-3cea-4ecd-b3f1-e24509f2d30f
08/20/2020 14:16:01 [DEBUG] seafes:73 update_repo: latest_commit_id: bec5a84854deecf8f52212737a8351b309e6403e, status.from_commit: None
08/20/2020 14:16:01 [DEBUG] seafes.extract:176 extract: extracting a0decb27-3cea-4ecd-b3f1-e24509f2d30f /test.md…
08/20/2020 14:16:01 [DEBUG] seafes.extract:178 extract: successfully extracted /test.md
08/20/2020 14:16:01 [DEBUG] seafes.extract:176 extract: extracting 9f014d9f-83e6-4d88-a682-97a2a598ecd8 /seafile-tutorial.doc…
08/20/2020 14:16:01 [DEBUG] seafes.extract:176 extract: extracting a0decb27-3cea-4ecd-b3f1-e24509f2d30f /1234.pdf…
08/20/2020 14:16:01 [DEBUG] seafes.extract:178 extract: successfully extracted /1234.pdf
08/20/2020 14:16:01 [DEBUG] seafes.extract:176 extract: extracting a0decb27-3cea-4ecd-b3f1-e24509f2d30f /seafile-tutorial.doc…
08/20/2020 14:16:01 [DEBUG] seafes.extract:178 extract: successfully extracted /seafile-tutorial.doc
08/20/2020 14:16:01 [DEBUG] seafes.extract:178 extract: successfully extracted /seafile-tutorial.doc
08/20/2020 14:16:01 [INFO] seafes:71 update_repo: Updating repo 9033b985-6125-4319-aa9d-9104205c9254
08/20/2020 14:16:01 [DEBUG] seafes:73 update_repo: latest_commit_id: 3b2452f50c1b9b69458391f295da178a53069e38, status.from_commit: None
08/20/2020 14:16:02 [DEBUG] seafes.extract:176 extract: extracting 9033b985-6125-4319-aa9d-9104205c9254 /seafile-tutorial.doc…
08/20/2020 14:16:02 [DEBUG] seafes:96 thread_task: Queue is empty, worker0 worker threads stop
08/20/2020 14:16:02 [INFO] seafes:124 thread_task: worker0 worker updated at 2020-08-20 14:16 time
08/20/2020 14:16:02 [INFO] seafes:129 thread_task: worker0 worker get 0 error
08/20/2020 14:16:02 [DEBUG] seafes.extract:178 extract: successfully extracted /seafile-tutorial.doc
08/20/2020 14:16:02 [DEBUG] seafes:96 thread_task: Queue is empty, worker1 worker threads stop
08/20/2020 14:16:02 [INFO] seafes:124 thread_task: worker1 worker updated at 2020-08-20 14:16 time
08/20/2020 14:16:02 [INFO] seafes:129 thread_task: worker1 worker get 0 error
08/20/2020 14:16:02 [INFO] seafes:38 clear_worker: All worker threads has stopped.
08/20/2020 14:16:02 [INFO] seafes:72 run: index updated, total time 4.10979890823 seconds
08/20/2020 14:16:02 [INFO] seafes:133 clear_deleted_repo: start to clear deleted repo
08/20/2020 14:16:03 [INFO] seafes:137 clear_deleted_repo: 0 repos need to be deleted.
08/20/2020 14:16:03 [INFO] seafes:141 clear_deleted_repo: deleted repo has been cleared
08/20/2020 14:16:03 [INFO] seafes:166 start_index_local:

Index updated, statistic report:

08/20/2020 14:16:03 [INFO] seafes:167 start_index_local: [commit read] 3
08/20/2020 14:16:03 [INFO] seafes:168 start_index_local: [dir read] 3
08/20/2020 14:16:03 [INFO] seafes:169 start_index_local: [file read] 5
08/20/2020 14:16:03 [INFO] seafes:170 start_index_local: [block read] 5

Anyone else having this issue? Thanks in advance!

You need to opt-in into full text search for pdfs.