- 15 Apr, 2021 1 commit
-
-
Guillaume Abrioux authored
When collocating daemons, if we chown all files under `/var/lib/ceph` it can cause issues for the collocated daemons that wouldn't have been migrated yet. This commit makes the playbook chown only the files corresponding to the daemon being migrated. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit ddbc11c4)
-
- 12 Apr, 2021 1 commit
-
-
Guillaume Abrioux authored
this adds a small documentation in the header of the playbook in order to explain what is the goal of this playbook. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 36b4227d)
-
- 09 Apr, 2021 1 commit
-
-
Guillaume Abrioux authored
8a174333 introduced a regression for containerized deployments. When deploying with `osd_auto_discovery: false` the osds never got activated. This commit fixes it. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1947072 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 02 Feb, 2021 2 commits
-
-
Guillaume Abrioux authored
This commit adds the missing `--ulimit nofile=1024:4096` on this task. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
Guillaume Abrioux authored
The task in charge of activating new osd is skipped because `devices` is populated with the devices being added only. It means it doesn't enter the condition since the length of `devices` is less than the number of activated osds. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1906293 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 29 Jan, 2021 1 commit
-
-
Dimitri Savineau authored
We already do that in the other systemd templates (mgr, mds, etc..) and would present to add workaround in other orchestration tool. This change is for containerized deployment only. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1882724 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit 3749d297)
-
- 23 Nov, 2020 1 commit
-
-
Guillaume Abrioux authored
This commit changes the bind mount option for the mount point `/var/lib/ceph` in the systemd template for mon and mgr containers. This is needed in case of collocating mon/mgr with osds using dmcrypt scenario. Once mon/mgr got converted to containers, the dmcrypt layer sub mount is still seen in `/var/lib/ceph`. For some reason it makes the corresponding devices busy so any other container can't open/close it. As a result, it prevents osds from starting properly. Since it only happens on the nodes converted before the OSD play, the idea is to bind mount `/var/lib/ceph` on mon and mgr with the `rshared` option so once the sub mount is unmounted, it is propagated inside the container so it doesn't see that mount point. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1896392 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit f5ba6d9b)
-
- 16 Nov, 2020 1 commit
-
-
Dimitri Savineau authored
fa2bb3af only fix the symlink owner/group issue in the OSD play. If the OSDs are collocated with other services like MONs and MGRs then the chown command will fail. $ find /var/lib/ceph/osd/ceph-0 -not -user 167 -execdir chown 167:167 {} + chown: cannot dereference './block': Permission denied Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1896448 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit 35ed9977)
-
- 13 Oct, 2020 1 commit
-
-
Guillaume Abrioux authored
when using `monitor_interface`, if nodes don't have same interface names this task will fail like following: ``` fatal: [argo010]: FAILED! => { "msg": "The task includes an option with an undefined variable. The error was: 'ansible.vars.hostvars.HostVarsVars object' has no attribute u'ansible_enp1s0f0'\n\nThe error appears to have been in '/usr/share/ceph-ansible/roles/ceph-mon/tasks/docker/main.yml': line 19, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n\n- name: ipv4 - force peer addition as potential bootstrap peer for cluster bringup - monitor_interface\n ^ here\n" } ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1876551 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 07 Oct, 2020 2 commits
-
-
Guillaume Abrioux authored
This adds some missing param to the container cli calls in ceph-osd-run.sh.j2 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1885558 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
Dimitri Savineau authored
90f3f615 introduced the docker-to-podman.yml playbook but the ceph-osd-run.sh.j2 template still has some docker hardcoded instead of using the container_binary variable. Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
- 29 Sep, 2020 1 commit
-
-
Dimitri Savineau authored
We don't need to show this information during the module execution. Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit a3f4e2b4)
-
- 25 Sep, 2020 1 commit
-
-
Kefu Chai authored
Fixes #5798 Signed-off-by:
Kefu Chai <tchaikov@gmail.com> (cherry picked from commit f3a78371)
-
- 21 Sep, 2020 1 commit
-
-
Dimitri Savineau authored
There's no need to set this fact with a set_fact We can achieve this in ceph-defaults Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1875058 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
- 17 Sep, 2020 2 commits
-
-
Dimitri Savineau authored
If the OSD directory is using symlinks for referencing devices (like block, db, wal for bluestore and journal for filestore) then the chown command could fail to change the owner:group on some system. $ ls -hl /var/lib/ceph/osd/ceph-0/ total 28K lrwxrwxrwx 1 ceph ceph 92 Sep 15 01:53 block -> /dev/ceph-45113532-95ca-471b-bd75-51de46f1339c/osd-data-570a1aee-60c0-44c9-8036-ffed7d67a4e6 -rw------- 1 ceph ceph 37 Sep 15 01:53 ceph_fsid -rw------- 1 ceph ceph 37 Sep 15 01:53 fsid -rw------- 1 ceph ceph 55 Sep 15 01:53 keyring -rw------- 1 ceph ceph 6 Sep 15 01:53 ready -rw------- 1 ceph ceph 3 Sep 15 02:00 require_osd_release -rw------- 1 ceph ceph 10 Sep 15 01:53 type -rw------- 1 ceph ceph 2 Sep 15 01:53 whoami $ find /var/lib/ceph/osd/ceph-0 -not -user 167 -execdir chown 167:167 {} + chown: cannot dereference './block': Permission denied $ find /var/lib/ceph/osd/ceph-0 -not -user 167 /var/lib/ceph/osd/ceph-0/block Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit da4280e2)
-
Dimitri Savineau authored
When running the switch2container playbook on a Debian based system then the systemd unit path isn't the same than Red Hat based system. Because the systemd unit files aren't removed then the new container systemd unit isn't take in count. Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit c1af69a7)
-
- 10 Sep, 2020 1 commit
-
-
Guillaume Abrioux authored
in order to avoid docker.io rate limiting Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 2001039c)
-
- 21 Aug, 2020 1 commit
-
-
RPietrzak authored
This together with condition 'ansible_play_hosts_all | last' causes skipping that task on the first host. Signed-off-by:
RPietrzak <rp.pietrzak@gmail.com>
-
- 20 Aug, 2020 2 commits
-
-
Guillaume Abrioux authored
remove same node for containerized deployments Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
Guillaume Abrioux authored
This node was needed for the upgrade job in stable-4.0. Since we moved the code erasure pool testing in lvm_osds, we don't need to fire up that node anymore. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 19 Aug, 2020 1 commit
-
-
Guillaume Abrioux authored
This commit moves the systemd rendering task into `systemd.yml` file. Otherwise, when running docker to podman playbook, the systemd unit file isn't updated as it should be. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1870141 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 05 Aug, 2020 1 commit
-
-
Guillaume Abrioux authored
This commit makes the bindmount a bit more generic, otherwise it currently makes the OSDs failing to start in an OSP FFU upgrade (with RHEL7 > RHEL8 OS upgrade). docker2podman playbook is run from ceph-ansible stable-3.2 branch against RHEL7 nodes where `/var/run/lvmetad.socket` exists but once the system is upgraded to RHEL8, this socket doesn't exist anymore and prevent OSDs from starting after the reboot. As a workaround we can make this bindmount a bit more generic like what is done in `stable-4.0` branch by mounting `/run/lvm` instead. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1866252 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 30 Jul, 2020 1 commit
-
-
Dimitri Savineau authored
When using non lvm scenarios (collocated or non-collocated) then the disk_list variable isn't set because this is done during the ceph-osd role (start_osds.yml) which isn't executed in the docker2podman playbook. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1862046 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
- 29 Jul, 2020 1 commit
-
-
Dimitri Savineau authored
The pytest-forked 1.3.0 release isn't compatible with the pytest release we are using in that branch. ----------------------- pytest-forked 1.3.0 requires pytest>=3.10, but you'll have pytest 3.6.1 which is incompatible. ----------------------- Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
- 07 Jul, 2020 4 commits
-
-
Dimitri Savineau authored
The automatic backport [1] done by mergify has merged the backport PR even if a conflict was present in the documentation. [1] https://github.com/ceph/ceph-ansible/pull/3803 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
Dimitri Savineau authored
By default, ansible gathers facts from facter and ohai if installed on the remote nodes, given we don't need them, let's exclude these facts from our facts gathering Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit c95adc56)
-
Dimitri Savineau authored
When using collocated or non-collocated osd_scenarios (ceph-disk) and trying to deterime the OSD_DEVICE from the OSD_ID passed to the systemd unit then we can be in a situation where the OSD hasn't been activated but the OSD ID exists. This means the data partition isn't in activate state and the ceph-disk list command won't show the OSD ID on the data partition. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1850377 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com>
-
Guillaume Abrioux authored
This isn't backported from master because there are too many changes between stable-3.2 and other newer branches. NOTE: This playbook *doesn't* add podman support in stable-3.2 at all. This is a tripleO dedicated playbook which is intended to be run early during FFU workflow in order to prepare the OS upgrade. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1853457 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-
- 03 Jul, 2020 3 commits
-
-
Guillaume Abrioux authored
This commit adds a note about `stable-3.0` `stable-3.1` branches which are deprecated and not maintained anymore. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit bbe30bcc)
-
Guillaume Abrioux authored
This commit updates the documentation to add a note about containerized deployments. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit e6148850)
-
Guillaume Abrioux authored
Typical error: ``` Warning, treated as error: /home/jenkins-build/build/workspace/ceph-ansible-docs-pull-requests/docs/source/day-2/upgrade.rst:2:Title underline too short. ``` Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 5c254861)
-
- 29 Jun, 2020 2 commits
-
-
Guillaume Abrioux authored
We shouldn't set this flag when running switch_to_containers playbook. Otherwise the playbook fails waiting for pgs to be clean. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1843569 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit b91d60d3)
-
Guillaume Abrioux authored
The workflow in this playbook should be the same than in rolling_update, we should first set noout and nodeep-scrub flags before migrating the first osd and unset osd flags after the last osd is migrated. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 2cfaa056)
-
- 25 Jun, 2020 2 commits
-
-
Guillaume Abrioux authored
This reverts commit 5a413409. We need to provide a tag for RHCS 3.3z6 without this commit.
-
Guillaume Abrioux authored
This reverts commit b7ec4a99. We need to provide a tag for RHCS 3.3z6 without this commit.
-
- 23 Jun, 2020 1 commit
-
-
Dimitri Savineau authored
When using docker container engine then the systemd unit scripts only use a dependency on the docker daemon via the After parameter. But if docker is restarted on a live system then the ceph systemd units should wait for the docker daemon to be fully restarted. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1846830 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit bd22f1d1)
-
- 18 Jun, 2020 3 commits
-
-
Dimitri Savineau authored
This commit adds a chapter about the ceph upgrade process. Closes: #5393 Signed-off-by:
Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit e41487db)
-
Guillaume Abrioux authored
We shouldn't set this flag when running switch_to_containers playbook. Otherwise the playbook fails waiting for pgs to be clean. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1843569 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit b91d60d3)
-
Guillaume Abrioux authored
The workflow in this playbook should be the same than in rolling_update, we should first set noout and nodeep-scrub flags before migrating the first osd and unset osd flags after the last osd is migrated. Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 2cfaa056)
-
- 27 Apr, 2020 1 commit
-
-
Guillaume Abrioux authored
This commit moves the dummy container creation task right before the cephx keys creation task so it can't be run out of time. Also, this commit makes the dummy container running for ever. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1828105 Signed-off-by:
Guillaume Abrioux <gabrioux@redhat.com>
-