unixpowered.com

TrueNAS boot-pool disk replacement

2024-06-10T13:15:07+02:00

One of the drives in boot-pool failed and needed replacement. Unfortunately, I mis-clicked in TrueNAS UI and accidentally detached the failed drive instead of replacing it. With a new drive in, here is what I did:

The pool was not healthy:

root@storedge01[~]# zpool status boot-pool
config:

        NAME        STATE     READ WRITE CKSUM
        boot-pool   DEGRADED     0     0     0
          mirror-0  DEGRADED     0     0     0
            da0p2   ONLINE       0     0     0
            da1p2   FAULTED      0     3     0

First let’s get current partitioning:

root@storedge01[/tmp]# gpart show
=>      40  30031792  da0  GPT  (14G)
        40      1024    1  freebsd-boot  (512K)
      1064  30030768    2  freebsd-zfs  (14G)

=>      40  30080944  da1  GPT  (14G) [CORRUPT]
        40      1024    1  freebsd-boot  (512K)
      1064  30079920    2  freebsd-zfs  (14G)

Recreate partitions:

root@storedge01[/tmp]# gpart destroy -F /dev/da1
da1 destroyed
root@storedge01[/tmp]# gpart create -s gpt /dev/da1
da1 created
root@storedge01[/tmp]# gpart show
=>      40  30031792  da0  GPT  (14G)
        40      1024    1  freebsd-boot  (512K)
      1064  30030768    2  freebsd-zfs  (14G)

=>      40  30080944  da1  GPT  (14G)
        40  30080944       - free -  (14G)
root@storedge01[/tmp]# gpart add -t freebsd-boot -s 512K /dev/da1
da1p1 added
root@storedge01[/tmp]# gpart add -t freebsd-zfs -s 30079920 /dev/da1
da1p2 added

Let’s recheck:

root@storedge01[/tmp]# gpart show
=>      40  30031792  da0  GPT  (14G)
        40      1024    1  freebsd-boot  (512K)
      1064  30030768    2  freebsd-zfs  (14G)

=>      40  30080944  da1  GPT  (14G)
        40      1024    1  freebsd-boot  (512K)
      1064  30079920    2  freebsd-zfs  (14G)

Install bootcode:

root@storedge01[/tmp]# gpart bootcode -b /boot/pmbr -p /boot/gptzfsboot -i 1 /dev/da1
partcode written to da1p1
bootcode written to da1

For some reason, re-attaching always failed unless ashift parameter was specified, which is basically blocksize:

root@storedge01[/tmp]# zpool attach -f -o ashift=9 boot-pool da0p2 da1p2

Value of ashift can found in zdb output:

root@storedge01[~]# zdb -C boot-pool | grep ashift
                ashift: 9

After the new disk was attached, system resilvered the pool.

Building minimal Jetson L4T rootfs for Nvidia Orin AGX CTI board

2024-04-15T15:10:12+02:00

I was trying to minimize Ubuntu installation footprint on Nvidia Orin board. I ran across this post from Nvidia on how to do that. Unfortunately, I had a specific Connect Tech board, which required some specific drivers from the vendor itself.

Note, that some instructions are taken from Nvidia’s documentation linked above. To combine Nvidia’s storage optimization instructions with Connect Tech’s specific flashing instructions, I ended up doing this:

First, get Jetson Linux from Nvidia and vendor bits from Connect Tech and decompress them:

dude@build:~/custom_fs$ curl -fsSLO https://developer.nvidia.com/embedded/l4t/r35_release_v1.0/release/jetson_linux_r35.1.0_aarch64.tbz2
dude@build:~/custom_fs$ curl -fsSLO https://connecttech.com/ftp/Drivers/CTI-L4T-ORIN-AGX-35.1.0-V005.tgz
dude@build:~/custom_fs$ tar xf jetson_linux_r35.1.0_aarch64.tbz2
dude@build:~/custom_fs$ sudo tar -xzf CTI-L4T-ORIN-AGX-35.1.0-V005.tgz -C ./Linux_for_Tegra

At this point, if you want to bake additional .deb packages into the filesystem, add them to ~/custom_fs/Linux_for_Tegra/nv_tegra/l4t_deb_packages. These packages are not the ones found in Ubuntu repos. Note: it seems, you have to handle dependencies manually.

If you want to include additional packages from Ubuntu Focal repos in the root filesystem, you have to add those to ~/custom_fs/Linux_for_Tegra/tools/samplefs/nvubuntu-focal-minimal-aarch64-packages. Additionally, make sure network-manager is listed as one of the packages to be installed. Now, you can build the root filesystem:

dude@build:~/custom_fs$ cd Linux_for_Tegra/tools/samplefs
dude@build:~/custom_fs/Linux_for_Tegra/tools/samplefs$ sudo ./nv_build_samplefs.sh --abi aarch64 --distro ubuntu --flavor minimal --version focal

New root filesystem will be created in ~/custom_fs/Linux_for_Tegra/tools/samplefs/sample_fs.tbz2. Put the new root filesystem into proper location for flashing and apply vendor bits:

dude@build:~/custom_fs/Linux_for_Tegra/tools/samplefs$ tar -jxf sample_fs.tbz2 -C ../../rootfs
dude@build:~/custom_fs/Linux_for_Tegra/tools/samplefs$ cd ~/custom_fs/Linux_for_Tegra/CTI-L4T
dude@build:~/custom_fs/Linux_for_Tegra/CTI-L4T$ sudo ./install.sh

If necessary, create OEM user and assign new device hostname. l4t_create_default_user.sh script touches necessary files in rootfs directory.

dude@build:~/custom_fs/Linux_for_Tegra/tools$ sudo ./l4t_create_default_user.sh -u admin -p Password123 -n devbox --accept-license

At this point, additional files can be placed in root_fs directory, and will end up on the device after flashing. Finally, flash the new root filesystem onto the device:

dude@build:~/custom_fs/Linux_for_Tegra$ ./flash.sh cti/orin-agx/forge/base mmcblk0p1

Somewhat spaghetti process, I suppose.

Limiting Kubelet log size in K3s

2023-10-02T08:44:22+02:00

I had this K3s v1.26.4+k3s1 running on Nvidia box somewhere out in the sticks. Then the device started running low on disk space, largely due to crappy containers running on it. /var/log/pods was taking up space that could be used elsewhere.

Here is how to limit size of /var/log/pods on K3s. Create /etc/rancher/k3s/config.yaml or if existent add the following arguments kubelet arguments:

kubelet-arg:
  - "container-log-max-files=2"
  - "container-log-max-size=2Mi"

This will cause k3s to create 2 log files, each with maximum size of 2M.

Alternatively the arguments can be passed to k3s binary itself.

k3s server --kubelet-arg container-log-max-files=4 --kubelet-arg container-log-max-size=50Mi

Modifying running Kubernetes deployment

2023-09-28T07:17:19+02:00

I was poking around K3s single node cluster and eventually ran into some issues with metrics-server. For some reason metrics-server was not starting properly.

[root@k3sm01 ~]# kubectl get apiservices
v1.k3s.cattle.io                       Local                        True                       15m
v1beta1.metrics.k8s.io                 kube-system/metrics-server   False (MissingEndpoints)   15m
v1.crd.projectcalico.org               Local                        True                       6m32s

I found a potential solution which unfortunately, did not help. Nevertheless, I needed to try to modify metrics-server deployment. Specifically, I needed to add –kubelet-insecure-tls option. Turns out this can be done using kubectl patch command:

[root@k3sm01 ~]# kubectl patch deployment metrics-server -n kube-system --type='json' -p='[{"op": "add", "path": "/spec/template/spec/containers/0/args/-", "value": "--kubelet-insecure-tls"}]'`

The path parameter is derived from components.yaml of metrics-server.

One can always do a re-deployment after editing the appropriate file.

NetworkManager IPv6 firehose

2023-04-10T22:45:47+02:00

While opinion of my own, disabling IPv6 in Rocky/RedHat/CentOS Linux is a messy affair. Nevertheless, sometimes system requirements call for such thing. In this case, disabling IPv6 has resulted in continuous flood of messages to syslog. About every 12 seconds!

Mar 17 07:58:23 prx013 NetworkManager[1154]:   [1679036303.3521] platform-linux: do-add-ip6-address[2: fe80::250:56ff:feb9:dc71]: failure 95 (Operation not supported)
Mar 17 07:58:25 prx013 NetworkManager[1154]:   [1679036305.3543] ipv6ll[40b05df140eccb36,ifindex=2]: changed: no IPv6 link local address to retry after Duplicate Address Detection failures (back off)

Oh yes, NetworkManager! Thankfully, the following stops the madness:

[root@prx013 ~]# nmcli device modify ens224 ipv6.method "disabled"
[root@prx013 ~]#

Needless to say NetworkManager is not my most favorite tool.

audit: kauditd hold queue overflow

2023-03-18T11:27:12+01:00

Recently, I came across a few servers, running Rocky Linux 8, that had consoles flooded with the following message:

[root@build01 ~]# dmesg | grep overflow
[    0.289646] audit: kauditd hold queue overflow
[    0.364107] audit: kauditd hold queue overflow

Apparently, there is a backlog limit for audit messages. This limit specifies queue size for unprocessed events intended for auditd. In this particular case, the limit was too low. This can be fixed by turning off auditd. But then, there is most likely a reason, why the daemon is on in the first place. Alternatively, the backlog queue limit can be increased.

To do so, in /etc/default/grub edit line starting with GRUB_CMDLINE_LINUX…:

...
GRUB_DISABLE_SUBMENU=true
GRUB_TERMINAL_OUTPUT="console"
GRUB_CMDLINE_LINUX="audit=1 audit_backlog_limit=8192 ipv6.disable=1 crashkernel=auto resume=/dev/mapper/system-swap rd.lvm.lv=system/root rd.lvm.lv=system/swap rhgb quiet"
GRUB_DISABLE_RECOVERY="true"
...

… and add audit_backlog_limit=8192, thus forcing the new hold queue size. After that GRUB configuration needs to be rebuilt:

[root@build01 ~]# grub2-mkconfig -o /boot/grub2/grub.cfg
[root@build01 ~]# 

That should do it.

Workaround to get fresh vSphere 8.0 install to boot in UEFI mode

2022-12-09T16:11:36+01:00

Previously, I ran into a boot problem during installation of vSphere 8.0 on Dell Optiplex 9010. This time I got to finish the installation successfully only to have the postinstall boot fail. The UEFI entry created by vSphere installer did not do anything when selected. Short story first: I had to manually re-create EFI partition filesystem after initial install of vSphere.

Now the details. After performing vSphere installation and booting the system, the UEFI boot menu would contain “VMware ESXi” entry. Selecting the entry would result in no response. VMware has an article suggesting a possible solution, which did not work in my case. All attempts to manually add a new boot entry in BIOS resulted in “File system not found” error.

So, I thought maybe filesystem on EFI partition is not clean. I booted the machine using USB stick and ran fsck on the partition:

root@mint:~# fsck /dev/sda1
fsck from util-linux 2.31.1
fsck.fat 4.1 (2017-01-24)
/dev/sda1: 11 files, 348/51091 clusters
root@mint:~#

Filesystem was clean, yet the machine failed to boot. After some searching I found a similar problem. Again, I tried the filesystem check, this time using FreeBSD:

# fsck_msdosfs /dev/da0s1
** /dev/da0s1
** Phase 1 - Read FAT and checking connectivity
** Phase 2 - Checking Directories
** Phase 3 - Checking for Lost Files
Next free cluster in FSInfo block (2) not free
Fix? [yn] y
4 files. 31MiB free (63781 clusters)
#

It seemed the filesystem check fixed an issue. Still the machine would not boot. The only thing I had not tried at this point was to recreate FAT filesystem on vSphere EFI partition…

root@mint:~# mount /dev/sda1 /tmp/EFIMNT/
root@mint:~# cp -r /tmp/EFIMNT/* /tmp/EFIBKP/
root@mint:~# umount /dev/sda1
root@mint:~# file -s /dev/sda1
/dev/sda1: DOS/MBR boot sector, code offset 0x58+2, OEM-ID "MSDOS5.0", sectors/cluster 4, reserved sectors 2, root entries 512, Media descriptor 0xf8, sectors/FAT 200, sectors/track 32, heads 64, sectors 204800 (volumes > 32 MB), serial number 0x558938bd, label: "BOOT       ", FAT (16 bit)
root@mint:~# mkfs -t vfat -n BOOT /dev/sda1
mkfs.fat 4.1 (2017-01-24)
root@mint:~#

…then check the new filesystem and put back the original boot files:

root@mint:~# file -s /dev/sda1
/dev/sda1: DOS/MBR boot sector, code offset 0x3c+2, OEM-ID "mkfs.fat", sectors/cluster 4, reserved sectors 4, root entries 512, Media descriptor 0xf8, sectors/FAT 200, sectors/track 32, heads 64, hidden sectors 64, sectors 204800 (volumes > 32 MB), serial number 0x31d4d250, label: "BOOT       ", FAT (16 bit)
root@mint:~# mount /dev/sda1 /tmp/EFIMNT/
root@mint:~# cp -r /tmp/EFIBKP/* /tmp/EFIMNT/
root@mint:~# umount /dev/sda1

Finally, the machine booted. I retried the whole process a few times. Above was the only time when FreeBSD fsck returned with unclean filesystem. So, I am not entirely sure, if vSphere 8 has some thing going on, or if it is the fact that my Dell Optiplex is so old. Nevertheless vSphere 8 was successfully installed.

Fatal error 10: (Out of resources) during vSphere 8.0 install

2022-11-07T18:17:56+01:00

During installation of vSphere 8.0 on my homelab’s Dell Optiplex 9010 I had run into this not so helpful error:

Error loading /vsan.v00
"Fatal error: 10 (Out of resources)"

Quick Google search revealed nothing useful, really. This article from VMware might be helpful to some. Unfortunately Optiplex has no such setting. In the end switching from “Legacy boot” to “UEFI” resolved the issue.

This other post might be worth checking out as well.

Using Workload Identity in GKE to connect to CloudSQL instances

2022-04-01T16:31:12+02:00

Workload identity is, I guess, the way to connect from GKE to CloudSQL. I fumbled a bit with this, so here me notes.

First you need to enable workload identity on your kubernetes cluster:

After that, you need to create kubernetes namespace and kubernetes service account called ksa-sql-workload:

[somedude@k2 ~]$ kubectl create namespace myknamespace
[somedude@k2 ~]$ kubectl create serviceaccount ksa-sql-workload --namespace myknamespace

Following that, create GCP service account gsa-sql-workload.

[somedude@k2 ~]$ gcloud iam service-accounts create gsa-sql-workload --project=somedudegproject

Add role to GCP service account. 123456789012 is the project ID of you GCP project. Here, gsa-sql-workload service account is assigned cloudsql.client role:

[somedude@k2 ~]$ gcloud projects add-iam-policy-binding 123456789012 --member "serviceAccount:gsa-sql-workload@@somedudegproject.iam.gserviceaccount.com" --role "roles/cloudsql.client"

Next, you bind GCP service account with kubernetes service account, so that kubernetes service account gets privileges of GCP service account. Sheesh…

[somedude@k2 ~]$ gcloud iam service-accounts add-iam-policy-binding --role "roles/iam.workloadIdentityUser" --member "serviceAccount:somedudegproject.svc.id.goog[myknamespace/ksa-sql-workload]" gsa-sql-workload@somedudegproject.iam.gserviceaccount.com

Updated IAM policy for serviceAccount [gsa-sql-workload@somedudegproject.iam.gserviceaccount.com].
bindings:
- members:
  - serviceAccount:somedudegproject.svc.id.goog[myknamespace/ksa-sql-workload]
  role: roles/iam.workloadIdentityUser
etag: A1234567890=
version: 1

Finally, annotate the service account:

[somedude@k2 ~]$ kubectl annotate serviceaccount ksa-sql-workload --namespace myknamespace iam.gke.io/gcp-service-account=gsa-sql-workload@somedudegproject.iam.gserviceaccount.com
serviceaccount/ksa-sql-workload annotated

If you enabled workload identity on cluster node pool, you can use nodeSelector in deployment to make sure your containers land on workload identity enabled nodes.

template:
  metadata:
    labels:
      app: ...
  spec:
    ...
    affinity:
      ...
    tolerations:
      ...
    ...
    serviceAccountName:
      ksa-sql-workload
    nodeSelector:
      iam.gke.io/gke-metadata-server-enabled: "true"

    containers:

Reference

Trunking VLAN’s towards Docker containers in VMware

2022-03-26T13:22:56+01:00

This is what it will look like: a VM running Rocky Linux with one NIC in trunking mode, having presence on three VLAN’s. VLAN 50 is the main VLAN used for management access and VLAN 701 and VLAN 60 would be used for containerized applications - in my case cantainers will use maclan driver. Additional application VLAN’s can be added later, as needed.

Start by configuring trunking on vCenter port group: Depending on what you need you can trunk all VLAN’s towards the VM (not wise) or you can trunk just specific VLAN ranges:

You might need to set the policies as follows:

Next, reconfigure the NIC on the Rocky VM to work as a trunk:

[root@docker01 network-scripts]# more ifcfg-ens224
NAME=ens224
DEVICE=ens224
ONBOOT=yes
NETBOOT="yes"
TYPE=Ethernet

Then, configure VLAN 50 interface for management…

[root@docker01 network-scripts]# cat ifcfg-ens224.50
VLAN=yes
TYPE=Vlan
PHYSDEV=ens224
VLAN_ID=50
REORDER_HDR=yes
GVRP=no
MVRP=no
HWADDR=
IPADDR=192.168.50.10
NETMASK=255.255.255.0
GATEWAY=192.168.50.254
DNS1=192.168.50.100
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=none
DEFROUTE=yes
IPV4_FAILURE_FATAL=no
IPV6INIT=yes
IPV6_AUTOCONF=yes
IPV6_DEFROUTE=yes
IPV6_FAILURE_FATAL=no
IPV6_ADDR_GEN_MODE=stable-privacy
NAME=ens224.50
UUID=bdfdd998-7359-4b5f-8ae7-7e3b7786bb22
DEVICE=ens224.50
ONBOOT=yes
PREFIX=24
RES_OPTIONS="rotate timeout:1 retries:1"

…and VALN701 interface:

[root@docker01 network-scripts]# cat ifcfg-ens224.701
VLAN=yes
TYPE=Vlan
PHYSDEV=ens224
VLAN_ID=701
REORDER_HDR=yes
GVRP=no
MVRP=no
IPADDR=192.168.70.29
NETMASK=255.255.255.240
GATEWAY=192.168.70.30
DNS1=192.168.70.1
PROXY_METHOD=none
BROWSER_ONLY=no
BOOTPROTO=none
DEFROUTE=no
IPV4_FAILURE_FATAL=no
IPV6INIT=no
IPV6_AUTOCONF=no
IPV6_DEFROUTE=no
IPV6_FAILURE_FATAL=no
IPV6_ADDR_GEN_MODE=stable-privacy
NAME=ens224.701
UUID=0809f5a6-1b2a-4a37-b664-fa9c46634c92
DEVICE=ens224.701
ONBOOT=yes
PREFIX=28
RES_OPTIONS="rotate timeout:1 retries:1"

The same configuration should be performed for VLAN 60 interface adjusting settings as needed.

At this point there should be three functioning interfaces, ens224.50, ens224.60 and ens224.701. Next, you need to configure symmetric routing. Rather than taking a stab at explanation, here is a pretty good one instead.

So, in my case the following rule-* and route-* files are needed for VLAN 50 and VLAN 701. You need to perform similar configuration for any other VLAN’s you will need.

The following is routing for VLAN 50. Note the default entry; this is the management interface:

[root@docker01 network-scripts]# cat route-ens224.50
192.168.50.0/24 dev ens224.50 src 192.168.50.10 table rt50
default via 192.168.50.254 table rt50

Next, I specify rule under which the above routes will be utilized:

[root@docker01 network-scripts]# cat rule-ens224.50
from 192.168.50.10 prio 50 table rt50

Similarly the following files deal with VLAN701. Note the absence of default entry:

[root@docker01 network-scripts]# cat rule-ens224.701
from 192.168.70.29 prio 70 table rt701

[root@docker01 network-scripts]# cat route-ens224.701
192.168.70.16/28 dev ens224.701 src 192.168.70.29 table rt701

Now, I need to make sure the alternate routing tables are defined in rt_tables. This is simply a mapping file that say “a number maps to this friendly name”.

[root@docker01 network-scripts]# cat /etc/iproute2/rt_tables
#
# reserved values
#
255	local
254	main
253	default
0	unspec
#
# local
#
#1	inr.ruhep
50	rt50
60	rt60
70	rt701

There are more details on the content of two files here.

Like me, you might need to install NetworkManager-dispatcher-routing-rules package that will allow NetworkManager process the route-* and rule-* files.

Finally, below is a snippet from docker-compose.yml. Note IP address assignment for interface VLAN 701 and its definition towards the bottom utilizing macvlan driver.

version: "3"
services:
  nginx:
    container_name: nginx
    image: nginx:mainline-alpine
    restart: always
    ports:
      - 80:80
    networks:
      db:
      vlan701:
          ipv4_address: 192.168.70.28
...
networks:
  vlan701:
    name: VLAN701 dmz-app to expose external apps
    driver: macvlan
    driver_opts:
      parent: ens224.701
    ipam:
      config:
        - subnet: 192.168.70.16/28
          gateway: 192.168.70.30

Depending on the environment you might need to fiddle with rp_filter, see more info below:

2 Nics with 2 different Gateway

When RHEL has multiple IPs configured, only one is reachable from a remote network. Or why does RHEL ignore packets when the route for outbound traffic differs from the route of incoming traffic?