Codebook learning for active sensing
A no-math intuitive account of my conference publication in SPAWC 2024.
Conference paper: IEEE
I presented this paper at SPAWC in Lucca, Italy (poster at the bottom). This publication further investigates the codebook design problem for active sensing (background on active sensing can be found here.
In this paper, I design a beamforming codebook for a reconfigurable intelligent surface (RIS) based active sensing scheme for uplink localization, in which the mobile user sends a sequence of pilots to the base station (BS), and the RIS is adaptively configured by carefully choosing RIS codewords from a codebook in a sequential manner to progressively focus on the target.
Most existing codebook designs for RIS are not tailored for active sensing where the choice of next codeword depends on the measurement available so far, and are not designed to dynamically focus reflection toward the target. Moreover, most existing codeword selection methods rely on exhaustive search in beam training to identify the codeword with the highest signal-to-noise ratio (SNR), thus incurring substantial pilot overhead as the codebook scales.
Here, I propose a learning-based approach for codebook construction and codeword selection for active sensing. The proposed learning approach can locate a target in the service area by recursively selecting a sequence of codewords from the codebook as more measurements become available without exhaustive beam training. The codebook design and the codeword selection fuse key ideas from vector quantized-variational autoencoder (VQVAE) and long short-term memory (LSTM) network to learn respectively the discrete function space of the codebook and the temporal dependencies between measurements.

The neural network architecture is shown above. The idea is that an LSTM network can be used to process the temporal input (pilots). At each sensing stage, the LSTM cell receives a new measurement and maps the measurement (along with historical measurements) to the sensing vectors in the next stage. The sensing vectors are then mapped to a codeword via vector quantization (VQ). However, the VQ operation lacks a gradient, which renders backward propagation ineffective in updating the LSTM network during the backward propagation. To address this issue, we use the idea of gradient approximation to estimate the unknown gradient to allow for weight updates. Finally, by concatenating a chain of LSTM cells, a sequence of sensing codewords can be selected and optimized. Through many training samples, all the codewords in the codebook is selected and optimized.
Poster:
