-
Hybrid Receiver Design for Massive MIMO-OFDM with Low-Resolution ADCs and Oversampling
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Italo Atzeni,
Markku Juntti
Abstract:
Low-resolution analog-to-digital converters (ADCs) and hybrid beamforming have emerged as efficient solutions to reduce power consumption with satisfactory spectral efficiency (SE) in massive multiple-input multiple-output (MIMO) systems. In this paper, we investigate the performance of a hybrid receiver in uplink massive MIMO orthogonal frequency-division multiplexing (OFDM) systems with low-reso…
▽ More
Low-resolution analog-to-digital converters (ADCs) and hybrid beamforming have emerged as efficient solutions to reduce power consumption with satisfactory spectral efficiency (SE) in massive multiple-input multiple-output (MIMO) systems. In this paper, we investigate the performance of a hybrid receiver in uplink massive MIMO orthogonal frequency-division multiplexing (OFDM) systems with low-resolution ADCs and oversampling. Considering both the temporal and spatial correlation of the quantization distortion (QD), we derive a closed-form approximation of the frequency-domain QD covariance matrix, which facilitates the evaluation of the system SE. Then we jointly design the analog and baseband combiners to maximize the SE. The formulated problem is significantly challenging due to the constant-modulus constraint of the analog combiner and its coupling with the digital one. To overcome the challenges, we transform the objective function into an equivalent but more tractable form and then iteratively update the analog and digital combiner. Numerical simulations verify the superiority of the proposed algorithm compared to the considered benchmarks and show the resilience of the hybrid receiver to beam squint for low-resolution systems. Furthermore, the results show that the proposed hybrid receiver design with oversampling can achieve significantly higher energy efficiency compared to the digital one.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Joint Beamforming Design and Bit Allocation in Massive MIMO with Resolution-Adaptive ADCs
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Italo Atzeni,
Markku Juntti
Abstract:
Low-resolution analog-to-digital converters (ADCs) have emerged as a promising technology for reducing power consumption and complexity in massive multiple-input multiple-output (MIMO) systems while maintaining satisfactory spectral and energy efficiencies (SE/EE). In this work, we first identify the essential properties of optimal quantization and leverage them to derive a closed-form approximati…
▽ More
Low-resolution analog-to-digital converters (ADCs) have emerged as a promising technology for reducing power consumption and complexity in massive multiple-input multiple-output (MIMO) systems while maintaining satisfactory spectral and energy efficiencies (SE/EE). In this work, we first identify the essential properties of optimal quantization and leverage them to derive a closed-form approximation of the covariance matrix of the quantization distortion. The theoretical finding facilitates the system SE analysis in the presence of low-resolution ADCs. We then focus on the joint optimization of the transmit-receive beamforming and bit allocation to maximize the SE under constraints on the transmit power and the total number of active ADC bits. To solve the resulting mixed-integer problem, we first develop an efficient beamforming design for fixed ADC resolutions. Then, we propose a low-complexity heuristic algorithm to iteratively optimize the ADC resolutions and beamforming matrices. Numerical results for a $64 \times 64$ MIMO system demonstrate that the proposed design offers $6\%$ improvement in both SE and EE with $40\%$ fewer active ADC bits compared with the uniform bit allocation. Furthermore, we numerically show that receiving more data streams with low-resolution ADCs can achieve higher SE and EE compared to receiving fewer data streams with high-resolution ADCs.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Joint Communications and Sensing Hybrid Beamforming Design via Deep Unfolding
Authors:
Nhan Thanh Nguyen,
Ly V. Nguyen,
Nir Shlezinger,
Yonina C. Eldar,
A. Lee Swindlehurst,
Markku Juntti
Abstract:
Joint communications and sensing (JCAS) is envisioned as a key feature in future wireless communications networks. In massive MIMO-JCAS systems, hybrid beamforming (HBF) is typically employed to achieve satisfactory beamforming gains with reasonable hardware cost and power consumption. Due to the coupling of the analog and digital precoders in HBF and the dual objective in JCAS, JCAS-HBF design pr…
▽ More
Joint communications and sensing (JCAS) is envisioned as a key feature in future wireless communications networks. In massive MIMO-JCAS systems, hybrid beamforming (HBF) is typically employed to achieve satisfactory beamforming gains with reasonable hardware cost and power consumption. Due to the coupling of the analog and digital precoders in HBF and the dual objective in JCAS, JCAS-HBF design problems are very challenging and usually require highly complex algorithms. In this paper, we propose a fast HBF design for JCAS based on deep unfolding to optimize a tradeoff between the communications rate and sensing accuracy. We first derive closed-form expressions for the gradients of the communications and sensing objectives with respect to the precoders and demonstrate that the magnitudes of the gradients pertaining to the analog precoder are typically smaller than those associated with the digital precoder. Based on this observation, we propose a modified projected gradient ascent (PGA) method with significantly improved convergence. We then develop a deep unfolded PGA scheme that efficiently optimizes the communications-sensing performance tradeoff with fast convergence thanks to the well-trained hyperparameters. In doing so, we preserve the interpretability and flexibility of the optimizer while leveraging data to improve performance. Finally, our simulations demonstrate the potential of the proposed deep unfolded method, which achieves up to 33.5% higher communications sum rate and 2.5 dB lower beampattern error compared with the conventional design based on successive convex approximation and Riemannian manifold optimization. Furthermore, it attains up to a 65% reduction in run time and computational complexity with respect to the PGA procedure without unfolding.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Analysis of Oversampling in Uplink Massive MIMO-OFDM with Low-Resolution ADCs
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Italo Atzeni,
Markku Juntti
Abstract:
Low-resolution analog-to-digital converters (ADCs) have emerged as an efficient solution for massive multiple-input multiple-output (MIMO) systems to reap high data rates with reasonable power consumption and hardware complexity. In this paper, we analyze the performance of oversampling in uplink massive MIMO orthogonal frequency-division multiplexing (MIMO-OFDM) systems with low-resolution ADCs.…
▽ More
Low-resolution analog-to-digital converters (ADCs) have emerged as an efficient solution for massive multiple-input multiple-output (MIMO) systems to reap high data rates with reasonable power consumption and hardware complexity. In this paper, we analyze the performance of oversampling in uplink massive MIMO orthogonal frequency-division multiplexing (MIMO-OFDM) systems with low-resolution ADCs. Considering both the temporal and spatial correlation of the quantization distortion, we derive an approximate closed-form expression of an achievable sum rate, which reveals how the oversampling ratio (OSR), the ADC resolution, and the signal-to-noise ratio (SNR) jointly affect the system performance. In particular, we demonstrate that oversampling can effectively improve the sum rate by mitigating the impact of the quantization distortion, especially at high SNR and with very low ADC resolution. Furthermore, we show that the considered low-resolution massive MIMO-OFDM system can achieve the same performance as the unquantized one when both the SNR and the OSR are sufficiently high. Numerical simulations confirm our analysis.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Deep Unfolding Enabled Constant Modulus Waveform Design for Joint Communications and Sensing
Authors:
Prashanth Krishnananthalingam,
Nhan Thanh Nguyen,
Markku Juntti
Abstract:
Joint communications and sensing (JCAS) systems have recently emerged as a promising technology to utilize the scarce spectrum in wireless networks and to reuse the same hardware to save infrastructure costs. In practical JCAS systems, dual functional constant-modulus waveforms can be employed to avoid signal distortion in nonlinear power amplifiers. However, the designs of such waveforms are very…
▽ More
Joint communications and sensing (JCAS) systems have recently emerged as a promising technology to utilize the scarce spectrum in wireless networks and to reuse the same hardware to save infrastructure costs. In practical JCAS systems, dual functional constant-modulus waveforms can be employed to avoid signal distortion in nonlinear power amplifiers. However, the designs of such waveforms are very challenging due to the nonconvex constant-modulus constraint. The conventional branch-and-bound (BnB) method can achieve optimal solution but at the cost of exponential complexity and long run time. In this paper, we propose an efficient deep unfolding method for the constant-modulus waveform design in a multiuser multiple-input multiple-output (MIMO) JCAS system. The deep unfolding model has a sparsely-connected structure and is trained in an unsupervised fashion. It achieves good communications-sensing performance tradeoff while maintaining low computational complexity and low run time. Specifically, our numerical results show that the proposed deep unfolding scheme achieves a similar achievable rate compared to the conventional BnB method with 30 times faster execution time.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Fairness Enhancement of UAV Systems with Hybrid Active-Passive RIS
Authors:
Nhan Thanh Nguyen,
Van-Dinh Nguyen,
Hieu Van Nguyen,
Qingqing Wu,
Antti Tolli,
Symeon Chatzinotas,
Markku Juntti
Abstract:
We consider unmanned aerial vehicle (UAV)-enabled wireless systems where downlink communications between a multi-antenna UAV and multiple users are assisted by a hybrid active-passive reconfigurable intelligent surface (RIS). We aim at a fairness design of two typical UAV-enabled networks, namely the static-UAV network where the UAV is deployed at a fixed location to serve all users at the same ti…
▽ More
We consider unmanned aerial vehicle (UAV)-enabled wireless systems where downlink communications between a multi-antenna UAV and multiple users are assisted by a hybrid active-passive reconfigurable intelligent surface (RIS). We aim at a fairness design of two typical UAV-enabled networks, namely the static-UAV network where the UAV is deployed at a fixed location to serve all users at the same time, and the mobile-UAV network which employs the time division multiple access protocol. In both networks, our goal is to maximize the minimum rate among users through jointly optimizing the UAV's location/trajectory, transmit beamformer, and RIS coefficients. The resulting problems are highly nonconvex due to a strong coupling between the involved variables. We develop efficient algorithms based on block coordinate ascend and successive convex approximation to effectively solve these problems in an iterative manner. In particular, in the optimization of the mobile-UAV network, closed-form solutions to the transmit beamformer and RIS passive coefficients are derived. Numerical results show that a hybrid RIS equipped with only 4 active elements and a power budget of 0 dBm offers an improvement of 38%-63% in minimum rate, while that achieved by a passive RIS is only about 15%, with the same total number of elements.
△ Less
Submitted 20 September, 2023; v1 submitted 24 June, 2023;
originally announced June 2023.
-
Joint Communications and Sensing Design for Multi-Carrier MIMO Systems
Authors:
Nhan Thanh Nguyen,
Nir Shlezinger,
Khac-Hoang Ngo,
Van-Dinh Nguyen,
Markku Juntti
Abstract:
In conventional joint communications and sensing (JCAS) designs for multi-carrier multiple-input multiple-output (MIMO) systems, the dual-functional waveforms are often optimized for the whole frequency band, resulting in limited communications--sensing performance tradeoff. To overcome the limitation, we propose employing a subset of subcarriers for JCAS, while the communications function is perf…
▽ More
In conventional joint communications and sensing (JCAS) designs for multi-carrier multiple-input multiple-output (MIMO) systems, the dual-functional waveforms are often optimized for the whole frequency band, resulting in limited communications--sensing performance tradeoff. To overcome the limitation, we propose employing a subset of subcarriers for JCAS, while the communications function is performed over all the subcarriers. This offers more degrees of freedom to enhance the communications performance under a given sensing accuracy. We first formulate the rate maximization under the sensing accuracy constraint to optimize the beamformers and JCAS subcarriers. The problem is solved via Riemannian manifold optimization and closed-form solutions. Numerical results for an 8x4 MIMO system with 64 subcarriers show that compared to the conventional subcarrier sharing scheme, the proposed scheme employing 16 JCAS subcarriers offers 60% improvement in the achievable communications rate at the signal-to-noise ratio of 10 dB. Meanwhile, this scheme generates the sensing beampattern with the same quality as the conventional JCAS design.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Beam Squint Analysis and Mitigation via Hybrid Beamforming Design in THz Communications
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Markku Juntti
Abstract:
We investigate the beam squint effect in uniform planar arrays (UPAs) and propose an efficient hybrid beamforming (HBF) design to mitigate the beam squint in multiple-input multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) systems operating at terahertz band. We first analyze the array gain and derive the closed-form beam squint ratio that characterizes the severity of the bea…
▽ More
We investigate the beam squint effect in uniform planar arrays (UPAs) and propose an efficient hybrid beamforming (HBF) design to mitigate the beam squint in multiple-input multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) systems operating at terahertz band. We first analyze the array gain and derive the closed-form beam squint ratio that characterizes the severity of the beam squint effect on UPAs. The effect is shown to be more severe with a higher fractional bandwidth, while it can be significantly mitigated when the shape of a UPA approaches a square. We then focus on the HBF design that maximizes the system spectral efficiency. The design problem is challenging due to the frequency-flat nature and hardware constraints of the analog beamformer. We overcome the challenges by proposing an efficient decoupling design in which the digital and analog beamformers admit closed-form solutions, which facilitate practical implementations. Numerical results validate our analysis and show that the proposed HBF design is robust to beam squint, and thus, it outperforms the state-of-the-art methods in wideband massive MIMO systems.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
AI-Empowered Hybrid MIMO Beamforming
Authors:
Nir Shlezinger,
Mengyuan Ma,
Ortal Lavi,
Nhan Thanh Nguyen,
Yonina C. Eldar,
Markku Juntti
Abstract:
Hybrid multiple-input multiple-output (MIMO) is an attractive technology for realizing extreme massive MIMO systems envisioned for future wireless communications in a scalable and power-efficient manner. However, the fact that hybrid MIMO systems implement part of their beamforming in analog and part in digital makes the optimization of their beampattern notably more challenging compared with conv…
▽ More
Hybrid multiple-input multiple-output (MIMO) is an attractive technology for realizing extreme massive MIMO systems envisioned for future wireless communications in a scalable and power-efficient manner. However, the fact that hybrid MIMO systems implement part of their beamforming in analog and part in digital makes the optimization of their beampattern notably more challenging compared with conventional fully digital MIMO. Consequently, recent years have witnessed a growing interest in using data-aided artificial intelligence (AI) tools for hybrid beamforming design. This article reviews candidate strategies to leverage data to improve real-time hybrid beamforming design. We discuss the architectural constraints and characterize the core challenges associated with hybrid beamforming optimization. We then present how these challenges are treated via conventional optimization, and identify different AI-aided design approaches. These can be roughly divided into purely data-driven deep learning models and different forms of deep unfolding techniques for combining AI with classical optimization.We provide a systematic comparative study between existing approaches including both numerical evaluations and qualitative measures. We conclude by presenting future research opportunities associated with the incorporation of AI in hybrid MIMO systems.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Deep Unfolding Hybrid Beamforming Designs for THz Massive MIMO Systems
Authors:
Nhan Thanh Nguyen,
Mengyuan Ma,
Nir Shlezinger,
Yonina C. Eldar,
A. L. Swindlehurst,
Markku Juntti
Abstract:
Hybrid beamforming (HBF) is a key enabler for wideband terahertz (THz) massive multiple-input multiple-output (mMIMO) communications systems. A core challenge with designing HBF systems stems from the fact their application often involves a non-convex, highly complex optimization of large dimensions. In this paper, we propose HBF schemes that leverage data to enable efficient designs for both the…
▽ More
Hybrid beamforming (HBF) is a key enabler for wideband terahertz (THz) massive multiple-input multiple-output (mMIMO) communications systems. A core challenge with designing HBF systems stems from the fact their application often involves a non-convex, highly complex optimization of large dimensions. In this paper, we propose HBF schemes that leverage data to enable efficient designs for both the fully-connected HBF (FC-HBF) and dynamic sub-connected HBF (SC-HBF) architectures. We develop a deep unfolding framework based on factorizing the optimal fully digital beamformer into analog and digital terms and formulating two corresponding equivalent least squares (LS) problems. Then, the digital beamformer is obtained via a closed-form LS solution, while the analog beamformer is obtained via ManNet, a lightweight sparsely-connected deep neural network based on unfolding projected gradient descent. Incorporating ManNet into the developed deep unfolding framework leads to the ManNet-based FC-HBF scheme. We show that the proposed ManNet can also be applied to SC-HBF designs after determining the connections between the radio frequency chain and antennas. We further develop a simplified version of ManNet, referred to as subManNet, that directly produces the sparse analog precoder for SC-HBF architectures. Both networks are trained with an unsupervised training procedure. Numerical results verify that the proposed ManNet/subManNet-based HBF approaches outperform the conventional model-based and deep unfolded counterparts with very low complexity and a fast run time. For example, in a simulation with 128 transmit antennas, it attains a slightly higher spectral efficiency than the Riemannian manifold scheme, but over 1000 times faster and with a complexity reduction of more than by a factor of six (6).
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Network-Aided Intelligent Traffic Steering in 6G O-RAN: A Multi-Layer Optimization Framework
Authors:
Van-Dinh Nguyen,
Thang X. Vu,
Nhan Thanh Nguyen,
Dinh C. Nguyen,
Markku Juntti,
Nguyen Cong Luong,
Dinh Thai Hoang,
Diep N. Nguyen,
Symeon Chatzinotas
Abstract:
To enable an intelligent, programmable and multi-vendor radio access network (RAN) for 6G networks, considerable efforts have been made in standardization and development of open RAN (O-RAN). So far, however, the applicability of O-RAN in controlling and optimizing RAN functions has not been widely investigated. In this paper, we jointly optimize the flow-split distribution, congestion control and…
▽ More
To enable an intelligent, programmable and multi-vendor radio access network (RAN) for 6G networks, considerable efforts have been made in standardization and development of open RAN (O-RAN). So far, however, the applicability of O-RAN in controlling and optimizing RAN functions has not been widely investigated. In this paper, we jointly optimize the flow-split distribution, congestion control and scheduling (JFCS) to enable an intelligent traffic steering application in O-RAN. Combining tools from network utility maximization and stochastic optimization, we introduce a multi-layer optimization framework that provides fast convergence, long-term utility-optimality and significant delay reduction compared to the state-of-the-art and baseline RAN approaches. Our main contributions are three-fold: i) we propose the novel JFCS framework to efficiently and adaptively direct traffic to appropriate radio units; ii) we develop low-complexity algorithms based on the reinforcement learning, inner approximation and bisection search methods to effectively solve the JFCS problem in different time scales; and iii) the rigorous theoretical performance results are analyzed to show that there exists a scaling factor to improve the tradeoff between delay and utility-optimization. Collectively, the insights in this work will open the door towards fully automated networks with enhanced control and flexibility. Numerical results are provided to demonstrate the effectiveness of the proposed algorithms in terms of the convergence rate, long-term utility-optimality and delay reduction.
△ Less
Submitted 29 May, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Switch-based Hybrid Beamforming Transceiver Design for Wideband Communications with Beam Squint
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Markku Juntti
Abstract:
Hybrid beamforming (HBF) transceiver architectures based on frequency-independent phase shifters (PS-HBF) are sensitive to the phases and physical directions with limited capability to compensate for the detrimental effects of the beam squint. Motivated by the fact that switches are phase-independent and more power/cost efficient than PSs, we consider the switch-based HBF (SW-HBF) for wideband lar…
▽ More
Hybrid beamforming (HBF) transceiver architectures based on frequency-independent phase shifters (PS-HBF) are sensitive to the phases and physical directions with limited capability to compensate for the detrimental effects of the beam squint. Motivated by the fact that switches are phase-independent and more power/cost efficient than PSs, we consider the switch-based HBF (SW-HBF) for wideband large-scale multiple-input multiple-output systems in this paper. We first derive a closed-form expression of the beam squint ratio and compare the expected array gains of both SW-HBF and PS-HBF architectures. The results show that SW-HBF is more robust to the beam squint effect. We then focus on the SW-HBF designs to maximize the spectral efficiency (SE) in both single-user and multiuser systems, which are both non-convex mixed-integer problems. For the former, by combining the tabu search (TS) method and projected gradient ascend (PGA), we propose an efficient heuristic PGA-TS algorithm to design analog beamformers while the digital ones admit closed-form solutions. For the latter, we develop a two-step algorithm based on fractional programming and the PGA-TS method. Simulations show that the proposed SW-HBF schemes are efficient and can outperform PS-based HBF architectures in terms of both SE and energy efficiency in terahertz communication systems.
△ Less
Submitted 20 November, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Robust tube-based LPV-MPC for autonomous lane keeping
Authors:
Maryam Nezami,
Hossam Seddik Abbas,
Ngoc Thinh Nguyen,
Georg Schildbach
Abstract:
This paper proposes a control architecture for autonomous lane keeping by a vehicle. In this paper, the vehicle dynamics consist of two parts: lateral and longitudinal dynamics. Therefore, the control architecture comprises two subsequent controllers. A longitudinal model predictive control (MPC) makes the vehicle track the desired longitudinal speeds that are assumed to be generated by a speed pl…
▽ More
This paper proposes a control architecture for autonomous lane keeping by a vehicle. In this paper, the vehicle dynamics consist of two parts: lateral and longitudinal dynamics. Therefore, the control architecture comprises two subsequent controllers. A longitudinal model predictive control (MPC) makes the vehicle track the desired longitudinal speeds that are assumed to be generated by a speed planner. The longitudinal speeds are then passed to a lateral MPC for lane keeping. Due to the dependence of the lateral dynamics on the longitudinal speed, they are represented in a linear parameter-varying (LPV) form, where its scheduling parameter is the longitudinal speed of the vehicle. In order to deal with the imprecise information of the future longitudinal speed (the scheduling parameter), a bound of uncertainty is considered around the nominal trajectory of the future longitudinal velocities. Then, a tube-based LPV- MPC is adopted to control the lateral dynamics for attaining the lane keeping goal. In the end, the effectiveness of the proposed methods is illustrated by carrying out simulation tests.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
A Safe Control Architecture Based on Robust Model Predictive Control for Autonomous Driving
Authors:
Maryam Nezami,
Ngoc Thinh Nguyen,
Georg Männel,
Hossam Seddik Abbas,
Georg Schildbach
Abstract:
This paper proposes a Robust Safe Control Architecture (RSCA) for safe-decision making. The system to be controlled is a vehicle in the presence of bounded disturbances. The RSCA consists of two parts: a Supervisor MPC and a Controller MPC. Both the Supervisor and the Controller are tube MPCs (TMPCs). The Supervisor MPC provides a safety certificate for an operating controller and a backup control…
▽ More
This paper proposes a Robust Safe Control Architecture (RSCA) for safe-decision making. The system to be controlled is a vehicle in the presence of bounded disturbances. The RSCA consists of two parts: a Supervisor MPC and a Controller MPC. Both the Supervisor and the Controller are tube MPCs (TMPCs). The Supervisor MPC provides a safety certificate for an operating controller and a backup control input in every step. After an unsafe action by the operating controller is predicted, the Controller MPC takes over the system. In this paper, a method for the computation of a terminal set is proposed, which is robust against changes in road curvature and forces the vehicle to reach a safe reference. Moreover, two important proofs are provided in this paper. First, it is shown that the backup control input is safe to be applied to the system to lead the vehicle to a safe state. Next, the recursive feasibility of the RSCA is proven. By simulating some obstacle avoidance scenarios, the effectiveness of the proposed RSCA is confirmed.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Leveraging Deep Neural Networks for Massive MIMO Data Detection
Authors:
Ly V. Nguyen,
Nhan T. Nguyen,
Nghi H. Tran,
Markku Juntti,
A. Lee Swindlehurst,
Duy H. N. Nguyen
Abstract:
Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, ma…
▽ More
Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, making conventional hand-engineered algorithms less computationally efficient. Low-complexity massive MIMO detection algorithms, especially those inspired or aided by deep learning, have emerged as a promising solution. While there exist many MIMO detection algorithms, the aim of this magazine paper is to provide insight into how to leverage deep neural networks (DNN) for massive MIMO detection. We review recent developments in DNN-based MIMO detection that incorporate the domain knowledge of established MIMO detection algorithms with the learning capability of DNNs. We then present a comparison of the key numerical performance metrics of these works. We conclude by describing future research areas and applications of DNNs in massive MIMO receivers.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Hybrid Active-Passive Reconfigurable Intelligent Surface-Assisted Multi-User MISO Systems
Authors:
Nhan Thanh Nguyen,
Van-Dinh Nguyen,
Qingqing Wu,
Antti Tolli,
Symeon Chatzinotas,
Markku Juntti
Abstract:
We consider a multi-user multiple-input single-output (MISO) communications system which is assisted by a hybrid active-passive reconfigurable intelligent surface (RIS). Unlike conventional passive RISs, hybrid RIS is equipped with a few active elements with the ability to reflect and amplify incident signals to significantly improve the system performance. Towards a fairness-oriented design, we m…
▽ More
We consider a multi-user multiple-input single-output (MISO) communications system which is assisted by a hybrid active-passive reconfigurable intelligent surface (RIS). Unlike conventional passive RISs, hybrid RIS is equipped with a few active elements with the ability to reflect and amplify incident signals to significantly improve the system performance. Towards a fairness-oriented design, we maximize the minimum rate among all users through jointly optimizing the transmit beamforming vectors and RIS reflecting/amplifying coefficients. Combining tools from block coordinate ascent and successive convex approximation, the challenging nonconvex problem is efficiently solved by a low-complexity iterative algorithm. The numerical results show that a hybrid RIS with 4 active elements out of a total of 50 elements with a power budget of -1 dBm offers an improvement of up to 80% to the considered system, while that achieved by a fully passive RIS is only 27%.
△ Less
Submitted 18 March, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Polyphonic audio event detection: multi-label or multi-class multi-task classification problem?
Authors:
Huy Phan,
Thi Ngoc Tho Nguyen,
Philipp Koch,
Alfred Mertins
Abstract:
Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better ha…
▽ More
Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better handle polyphonic mixtures, we propose to frame the task as a multi-class classification problem by considering each possible label combination as one class. To circumvent the large number of arising classes due to combinatorial explosion, we divide the event categories into multiple groups and construct a multi-task problem in a divide-and-conquer fashion, where each of the tasks is a multi-class classification problem. A network architecture is then devised for multi-class multi-task modelling. The network is composed of a backbone subnet and multiple task-specific subnets. The task-specific subnets are designed to learn time-frequency and channel attention masks to extract features for the task at hand from the common feature maps learned by the backbone. Experiments on the TUT-SED-Synthetic-2016 with high degree of event overlap show that the proposed approach results in more favorable performance than the common multi-label approach.
△ Less
Submitted 29 January, 2022;
originally announced January 2022.
-
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays
Authors:
Thi Ngoc Tho Nguyen,
Douglas L. Jones,
Karn N. Watcharasupat,
Huy Phan,
Woon-Seng Gan
Abstract:
Polyphonic sound event localization and detection (SELD) has many practical applications in acoustic sensing and monitoring. However, the development of real-time SELD has been limited by the demanding computational requirement of most recent SELD systems. In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using microphone array inputs. SALSA-Lite is a lightwei…
▽ More
Polyphonic sound event localization and detection (SELD) has many practical applications in acoustic sensing and monitoring. However, the development of real-time SELD has been limited by the demanding computational requirement of most recent SELD systems. In this work, we introduce SALSA-Lite, a fast and effective feature for polyphonic SELD using microphone array inputs. SALSA-Lite is a lightweight variation of a previously proposed SALSA feature for polyphonic SELD. SALSA, which stands for Spatial Cue-Augmented Log-Spectrogram, consists of multichannel log-spectrograms stacked channelwise with the normalized principal eigenvectors of the spectrotemporally corresponding spatial covariance matrices. In contrast to SALSA, which uses eigenvector-based spatial features, SALSA-Lite uses normalized inter-channel phase differences as spatial features, allowing a 30-fold speedup compared to the original SALSA feature. Experimental results on the TAU-NIGENS Spatial Sound Events 2021 dataset showed that the SALSA-Lite feature achieved competitive performance compared to the full SALSA feature, and significantly outperformed the traditional feature set of multichannel log-mel spectrograms with generalized cross-correlation spectra. Specifically, using SALSA-Lite features increased localization-dependent F1 score and class-dependent localization recall by 15% and 5%, respectively, compared to using multichannel log-mel spectrograms with generalized cross-correlation spectra.
△ Less
Submitted 4 May, 2022; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Switch-based Hybrid Beamforming for Wideband Multi-carrier Communications
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Markku Juntti
Abstract:
Switch-based hybrid beamforming (SW-HBF) architectures are promising for realizing massive multiple-input multiple-output (MIMO) communications systems because of their low cost and low power consumption. In this paper, we study the performance of SW-HBF in a wideband multi-carrier MIMO communication system considering the beam squint effect. We aim at designing the switch-based combiner that maxi…
▽ More
Switch-based hybrid beamforming (SW-HBF) architectures are promising for realizing massive multiple-input multiple-output (MIMO) communications systems because of their low cost and low power consumption. In this paper, we study the performance of SW-HBF in a wideband multi-carrier MIMO communication system considering the beam squint effect. We aim at designing the switch-based combiner that maximizes the system spectral efficiency (SE). However, the design problem is challenging because the analog combing matrix elements are binary variables. To overcome this, we propose tabu search-based (TS) SW-HBF schemes that can attain near-optimal performance with reasonable computational complexity. Furthermore, we compare the total power consumption and energy efficiency (EE) of the SW-HBF architecture to those of the phase-shifter-based hybrid beamforming (PS-HBF) architecture. Numerical simulations show that the proposed algorithms can efficiently find near-optimal solutions. Moreover, the SW-HBF scheme can significantly mitigate the beam squint effect and is less affected by the number of subcarriers than PS-HBF. It also provides improved SE and EE performance compared to PS-HBF schemes.
△ Less
Submitted 21 November, 2021; v1 submitted 12 October, 2021;
originally announced October 2021.
-
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression
Authors:
Karn N. Watcharasupat,
Thi Ngoc Tho Nguyen,
Woon-Seng Gan,
Shengkui Zhao,
Bin Ma
Abstract:
Echo and noise suppression is an integral part of a full-duplex communication system. Many recent acoustic echo cancellation (AEC) systems rely on a separate adaptive filtering module for linear echo suppression and a neural module for residual echo suppression. However, not only do adaptive filtering modules require convergence and remain susceptible to changes in acoustic environments, but this…
▽ More
Echo and noise suppression is an integral part of a full-duplex communication system. Many recent acoustic echo cancellation (AEC) systems rely on a separate adaptive filtering module for linear echo suppression and a neural module for residual echo suppression. However, not only do adaptive filtering modules require convergence and remain susceptible to changes in acoustic environments, but this two-stage framework also often introduces unnecessary delays to the AEC system when neural modules are already capable of both linear and nonlinear echo suppression. In this paper, we exploit the offset-compensating ability of complex time-frequency masks and propose an end-to-end complex-valued neural network architecture. The building block of the proposed model is a pseudocomplex extension based on the densely-connected multidilated DenseNet (D3Net) building block, resulting in a very small network of only 354K parameters. The architecture utilized the multi-resolution nature of the D3Net building blocks to eliminate the need for pooling, allowing the network to extract features using large receptive fields without any loss of output resolution. We also propose a dual-mask technique for joint echo and noise suppression with simultaneous speech enhancement. Evaluation on both synthetic and real test sets demonstrated promising results across multiple energy-based metrics and perceptual proxies.
△ Less
Submitted 22 January, 2022; v1 submitted 2 October, 2021;
originally announced October 2021.
-
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection
Authors:
Thi Ngoc Tho Nguyen,
Karn N. Watcharasupat,
Ngoc Khanh Nguyen,
Douglas L. Jones,
Woon-Seng Gan
Abstract:
Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses amplitude and/or phase differences between microphones to estimate source directions. As a result, it is often di…
▽ More
Sound event localization and detection (SELD) consists of two subtasks, which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses amplitude and/or phase differences between microphones to estimate source directions. As a result, it is often difficult to jointly optimize these two subtasks. We propose a novel feature called Spatial cue-Augmented Log-SpectrogrAm (SALSA) with exact time-frequency mapping between the signal power and the source directional cues, which is crucial for resolving overlapping sound sources. The SALSA feature consists of multichannel log-spectrograms stacked along with the normalized principal eigenvector of the spatial covariance matrix at each corresponding time-frequency bin. Depending on the microphone array format, the principal eigenvector can be normalized differently to extract amplitude and/or phase differences between the microphones. As a result, SALSA features are applicable for different microphone array formats such as first-order ambisonics (FOA) and multichannel microphone array (MIC). Experimental results on the TAU-NIGENS Spatial Sound Events 2021 dataset with directional interferences showed that SALSA features outperformed other state-of-the-art features. Specifically, the use of SALSA features in the FOA format increased the F1 score and localization recall by 6% each, compared to the multichannel log-mel spectrograms with intensity vectors. For the MIC format, using SALSA features increased F1 score and localization recall by 16% and 7%, respectively, compared to using multichannel log-mel spectrograms with generalized cross-correlation spectra.
△ Less
Submitted 6 June, 2022; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Low-Latency and Secure Computation Offloading Assisted by Hybrid Relay-Reflecting Intelligent Surface
Authors:
Khac-Hoang Ngo,
Nhan Thanh Nguyen,
Thinh Quang Dinh,
Trong-Minh Hoang,
Markku Juntti
Abstract:
Recently, the hybrid relay-reflecting intelligent surface (HRRIS) has been introduced as a spectral- and energy-efficient architecture to assist wireless communication systems. In the HRRIS, a single or few active relay elements are deployed along with a large number of passive reflecting elements, allowing it to not only reflect but also amplify the incident signals. In this work, we investigate…
▽ More
Recently, the hybrid relay-reflecting intelligent surface (HRRIS) has been introduced as a spectral- and energy-efficient architecture to assist wireless communication systems. In the HRRIS, a single or few active relay elements are deployed along with a large number of passive reflecting elements, allowing it to not only reflect but also amplify the incident signals. In this work, we investigate the potential of the HRRIS in aiding the computation offloading in a single-user mobile edge computing system. The objective is to minimize the offloading latency while ensuring the secrecy of user data against a malicious eavesdropper. We develop efficient solutions to this latency minimization problem based on alternating optimization. Through numerical results, we show that the deployment of the HRRIS can result in a considerable reduction in latency. Furthermore, the latency reduction gain offered by the HRRIS is much more significant than that of the conventional reconfigurable intelligent surface (RIS).
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Closed-Form Hybrid Beamforming Solution for Spectral Efficiency Upper Bound Maximization in mmWave MIMO-OFDM Systems
Authors:
Mengyuan Ma,
Nhan Thanh Nguyen,
Markku Juntti
Abstract:
Hybrid beamforming is considered a key enabler to realize millimeter wave (mmWave) multiple-input multiple-output (MIMO) communications due to its capability of considerably reducing the number of costly and power-hungry radio frequency chains in the transceiver. However, in mmWave MIMO orthogonal frequency-division multiplexing (MIMO-OFDM) systems, hybrid beamforming design is challenging because…
▽ More
Hybrid beamforming is considered a key enabler to realize millimeter wave (mmWave) multiple-input multiple-output (MIMO) communications due to its capability of considerably reducing the number of costly and power-hungry radio frequency chains in the transceiver. However, in mmWave MIMO orthogonal frequency-division multiplexing (MIMO-OFDM) systems, hybrid beamforming design is challenging because the analog precoder and combiner are required to be shared across the whole employed bandwidth. In this paper, we propose closed-form solutions to the problem of designing the analog precoder/combiner in a mmWave MIMO-OFDM system by maximizing the upper bound of the spectral efficiency. The closed-form solutions facilitate the design of analog beamformers while guaranteeing state-of-art performance. Numerical results show that the proposed algorithm attains a slightly improved performance with much lower computational complexity compared to the considered benchmarks.
△ Less
Submitted 24 August, 2021; v1 submitted 15 August, 2021;
originally announced August 2021.
-
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning
Authors:
Karn N. Watcharasupat,
Thi Ngoc Tho Nguyen,
Ngoc Khanh Nguyen,
Zhen Jian Lee,
Douglas L. Jones,
Woon Seng Gan
Abstract:
The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection. Conventional training of polyphonic sound event detection systems with binary cross-e…
▽ More
The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as Dice loss) due to its robustness in tasks where the number of negative samples significantly exceeds that of positive samples, such as semantic segmentation, natural language processing, and sound event detection. Conventional training of polyphonic sound event detection systems with binary cross-entropy loss often results in suboptimal detection performance as the training is often overwhelmed by updates from negative samples. In this paper, we investigated the effect of the Dice loss, intra- and inter-modal transfer learning, data augmentation, and recording formats, on the performance of polyphonic sound event detection systems with multichannel inputs. Our analysis showed that polyphonic sound event detection systems trained with Dice loss consistently outperformed those trained with cross-entropy loss across different training settings and recording formats in terms of F1 score and error rate. We achieved further performance gains via the use of transfer learning and an appropriate combination of different data augmentation techniques.
△ Less
Submitted 2 October, 2021; v1 submitted 22 July, 2021;
originally announced July 2021.
-
What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis
Authors:
Thi Ngoc Tho Nguyen,
Karn N. Watcharasupat,
Zhen Jian Lee,
Ngoc Khanh Nguyen,
Douglas L. Jones,
Woon Seng Gan
Abstract:
Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an additional challenge of assigning correct corresp…
▽ More
Sound event localization and detection (SELD) is an emerging research topic that aims to unify the tasks of sound event detection and direction-of-arrival estimation. As a result, SELD inherits the challenges of both tasks, such as noise, reverberation, interference, polyphony, and non-stationarity of sound sources. Furthermore, SELD often faces an additional challenge of assigning correct correspondences between the detected sound classes and directions of arrival to multiple overlapping sound events. Previous studies have shown that unknown interferences in reverberant environments often cause major degradation in the performance of SELD systems. To further understand the challenges of the SELD task, we performed a detailed error analysis on two of our SELD systems, which both ranked second in the team category of DCASE SELD Challenge, one in 2020 and one in 2021. Experimental results indicate polyphony as the main challenge in SELD, due to the difficulty in detecting all sound events of interest. In addition, the SELD systems tend to make fewer errors for the polyphonic scenario that is dominant in the training set.
△ Less
Submitted 2 October, 2021; v1 submitted 22 July, 2021;
originally announced July 2021.
-
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
Authors:
Thi Ngoc Tho Nguyen,
Karn Watcharasupat,
Ngoc Khanh Nguyen,
Douglas L. Jones,
Woon Seng Gan
Abstract:
Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses magnitude or phase differences between microphones to estimate source directions. Therefore, it is often difficult to joi…
▽ More
Sound event localization and detection consists of two subtasks which are sound event detection and direction-of-arrival estimation. While sound event detection mainly relies on time-frequency patterns to distinguish different sound classes, direction-of-arrival estimation uses magnitude or phase differences between microphones to estimate source directions. Therefore, it is often difficult to jointly train these two subtasks simultaneously. We propose a novel feature called spatial cue-augmented log-spectrogram (SALSA) with exact time-frequency mapping between the signal power and the source direction-of-arrival. The feature includes multichannel log-spectrograms stacked along with the estimated direct-to-reverberant ratio and a normalized version of the principal eigenvector of the spatial covariance matrix at each time-frequency bin on the spectrograms. Experimental results on the DCASE 2021 dataset for sound event localization and detection with directional interference showed that the deep learning-based models trained on this new feature outperformed the DCASE challenge baseline by a large margin. We combined several models with slightly different architectures that were trained on the new feature to further improve the system performances for the DCASE sound event localization and detection challenge.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
VinDr-SpineXR: A deep learning framework for spinal lesions detection and classification from radiographs
Authors:
Hieu T. Nguyen,
Hieu H. Pham,
Nghia T. Nguyen,
Ha Q. Nguyen,
Thang Q. Huynh,
Minh Dao,
Van Vu
Abstract:
Radiographs are used as the most important imaging tool for identifying spine anomalies in clinical practice. The evaluation of spinal bone lesions, however, is a challenging task for radiologists. This work aims at developing and evaluating a deep learning-based framework, named VinDr-SpineXR, for the classification and localization of abnormalities from spine X-rays. First, we build a large data…
▽ More
Radiographs are used as the most important imaging tool for identifying spine anomalies in clinical practice. The evaluation of spinal bone lesions, however, is a challenging task for radiologists. This work aims at developing and evaluating a deep learning-based framework, named VinDr-SpineXR, for the classification and localization of abnormalities from spine X-rays. First, we build a large dataset, comprising 10,468 spine X-ray images from 5,000 studies, each of which is manually annotated by an experienced radiologist with bounding boxes around abnormal findings in 13 categories. Using this dataset, we then train a deep learning classifier to determine whether a spine scan is abnormal and a detector to localize 7 crucial findings amongst the total 13. The VinDr-SpineXR is evaluated on a test set of 2,078 images from 1,000 studies, which is kept separate from the training set. It demonstrates an area under the receiver operating characteristic curve (AUROC) of 88.61% (95% CI 87.19%, 90.02%) for the image-level classification task and a mean average precision (mAP@0.5) of 33.56% for the lesion-level localization task. These results serve as a proof of concept and set a baseline for future research in this direction. To encourage advances, the dataset, codes, and trained deep learning models are made publicly available.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
Designing a Pseudo-Random Bit Generator with a Novel 5D-Hyperchaotic System
Authors:
Ngoc T. Nguyen,
Toan Q. Bui,
Ghyslain Gagnon,
Pascal Giard,
Georges Kaddoum
Abstract:
Dynamic and non-linear systems are emerging as potential candidates for random bit generation. In this context, chaotic systems, which are both dynamic and stochastic, are particularly suitable. This paper introduces a new continuous chaotic system along with its corresponding implementation, which targets field-programmable gate array (FPGA). This chaotic system has five dimensions, which exhibit…
▽ More
Dynamic and non-linear systems are emerging as potential candidates for random bit generation. In this context, chaotic systems, which are both dynamic and stochastic, are particularly suitable. This paper introduces a new continuous chaotic system along with its corresponding implementation, which targets field-programmable gate array (FPGA). This chaotic system has five dimensions, which exhibit complex chaotic dynamics, thus enabling the utilization of chaotic signals in cryptography. A mathematical analysis is presented to demonstrate the dynamic characteristics of the proposed hyperchaotic system. A novel digital implementation of the proposed system is presented. Moreover, a data scrambling circuit is implemented to eliminate the bias effect and increase the randomness of the bitstream generated from the chaotic signals. We show that the proposed random bit generator has high randomness. The generated bits successfully pass well-known statistical randomness test-suites, i.e., NIST SP800-22, Diehard, and TestU01. The ready-to-use random bit generator is deployed on a Xilinx Zynq-7000 SoC ZC702 Evaluation Kit. Experimental results show that the proposed random bit generator can achieve a maximum throughput of 6.78 Gbps, which is over 3.6 times greater than state-of-the-art designs while requiring under 4% of the resources available on the targeted FPGA.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
Machine Learning-based Reconfigurable Intelligent Surface-aided MIMO Systems
Authors:
Nhan Thanh Nguyen,
Ly V. Nguyen,
Thien Huynh-The,
Duy H. N. Nguyen,
A. Lee Swindlehurst,
Markku Juntti
Abstract:
Reconfigurable intelligent surface (RIS) technology has recently emerged as a spectral- and cost-efficient approach for wireless communications systems. However, existing hand-engineered schemes for passive beamforming design and optimization of RIS, such as the alternating optimization (AO) approaches, require a high computational complexity, especially for multiple-input-multiple-output (MIMO) s…
▽ More
Reconfigurable intelligent surface (RIS) technology has recently emerged as a spectral- and cost-efficient approach for wireless communications systems. However, existing hand-engineered schemes for passive beamforming design and optimization of RIS, such as the alternating optimization (AO) approaches, require a high computational complexity, especially for multiple-input-multiple-output (MIMO) systems. To overcome this challenge, we propose a low-complexity unsupervised learning scheme, referred to as learning-phase-shift neural network (LPSNet), to efficiently find the solution to the spectral efficiency maximization problem in RIS-aided MIMO systems. In particular, the proposed LPSNet has an optimized input structure and requires a small number of layers and nodes to produce efficient phase shifts for the RIS. Simulation results for a 16x2 MIMO system assisted by an RIS with 40 elements show that the LPSNet achieves 97.25% of the SE provided by the AO counterpart with more than a 95% reduction in complexity.
△ Less
Submitted 1 May, 2021;
originally announced May 2021.
-
Spectral Efficiency Optimization for Hybrid Relay-Reflecting Intelligent Surface
Authors:
Nhan Thanh Nguyen,
Quang-Doanh Vu,
Kyungchun Lee,
Markku Juntti
Abstract:
We propose a novel concept of hybrid relay-reflecting intelligent surface (HR-RIS), in which a single or few elements are deployed with power amplifiers (PAs) to serve as active relays, while the remaining elements only reflect the incident signals. The design and optimization of the HR-RIS is formulated in a spectral efficiency (SE) maximization problem, which is efficiently solved by the alterna…
▽ More
We propose a novel concept of hybrid relay-reflecting intelligent surface (HR-RIS), in which a single or few elements are deployed with power amplifiers (PAs) to serve as active relays, while the remaining elements only reflect the incident signals. The design and optimization of the HR-RIS is formulated in a spectral efficiency (SE) maximization problem, which is efficiently solved by the alternating optimization (AO) method. The simulation results show that a significant improvement in the SE can be attained by the proposed HR-RIS, even with a limited power budget, with respect to the conventional reconfigurable intelligent surface (RIS). In particular, the favorable design and deployment of the HR-RIS are analytically derived and numerically justified.
△ Less
Submitted 1 May, 2021;
originally announced May 2021.
-
Channel Estimation and Hybrid Architectures for RIS-Assisted Communications
Authors:
Jiguang He,
Nhan Thanh Nguyen,
Rafaela Schroeder,
Visa Tapio,
Joonas Kokkoniemi,
Markku Juntti
Abstract:
Reconfigurable intelligent surfaces (RISs) are considered as potential technologies for the upcoming sixth-generation (6G) wireless communication system. Various benefits brought by deploying one or multiple RISs include increased spectrum and energy efficiency, enhanced connectivity, extended communication coverage, reduced complexity at transceivers, and even improved localization accuracy. Howe…
▽ More
Reconfigurable intelligent surfaces (RISs) are considered as potential technologies for the upcoming sixth-generation (6G) wireless communication system. Various benefits brought by deploying one or multiple RISs include increased spectrum and energy efficiency, enhanced connectivity, extended communication coverage, reduced complexity at transceivers, and even improved localization accuracy. However, to unleash their full potential, fundamentals related to RISs, ranging from physical-layer (PHY) modelling to RIS phase control, need to be addressed thoroughly. In this paper, we provide an overview of some timely research problems related to the RIS technology, i.e., PHY modelling (including also physics), channel estimation, potential RIS architectures, and RIS phase control (via both model-based and data-driven approaches), along with recent numerical results. We envision that more efforts will be devoted towards intelligent wireless environments, enabled by RISs.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
A clinical validation of VinDr-CXR, an AI system for detecting abnormal chest radiographs
Authors:
Ngoc Huy Nguyen,
Ha Quy Nguyen,
Nghia Trung Nguyen,
Thang Viet Nguyen,
Hieu Huy Pham,
Tuan Ngoc-Minh Nguyen
Abstract:
Computer-Aided Diagnosis (CAD) systems for chest radiographs using artificial intelligence (AI) have recently shown a great potential as a second opinion for radiologists. The performances of such systems, however, were mostly evaluated on a fixed dataset in a retrospective manner and, thus, far from the real performances in clinical practice. In this work, we demonstrate a mechanism for validatin…
▽ More
Computer-Aided Diagnosis (CAD) systems for chest radiographs using artificial intelligence (AI) have recently shown a great potential as a second opinion for radiologists. The performances of such systems, however, were mostly evaluated on a fixed dataset in a retrospective manner and, thus, far from the real performances in clinical practice. In this work, we demonstrate a mechanism for validating an AI-based system for detecting abnormalities on X-ray scans, VinDr-CXR, at the Phu Tho General Hospital - a provincial hospital in the North of Vietnam. The AI system was directly integrated into the Picture Archiving and Communication System (PACS) of the hospital after being trained on a fixed annotated dataset from other sources. The performance of the system was prospectively measured by matching and comparing the AI results with the radiology reports of 6,285 chest X-ray examinations extracted from the Hospital Information System (HIS) over the last two months of 2020. The normal/abnormal status of a radiology report was determined by a set of rules and served as the ground truth. Our system achieves an F1 score - the harmonic average of the recall and the precision - of 0.653 (95% CI 0.635, 0.671) for detecting any abnormalities on chest X-rays. Despite a significant drop from the in-lab performance, this result establishes a high level of confidence in applying such a system in real-life situations.
△ Less
Submitted 6 April, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
Hybrid Relay-Reflecting Intelligent Surface-Aided Wireless Communications: Opportunities, Challenges, and Future Perspectives
Authors:
Nhan Thanh Nguyen,
Jiguang He,
Van-Dinh Nguyen,
Henk Wymeersch,
Derrick Wing Kwan Ng,
Robert Schober,
Symeon Chatzinotas,
Markku Juntti
Abstract:
Reconfigurable intelligent surfaces (RISs) have emerged as a cost- and energy-efficient technology that can customize and program the physical propagation environment by reflecting radio waves in preferred directions. However, the purely passive reflection of RISs not only limits the end-to-end channel beamforming gains, but also hinders the acquisition of accurate channel state information for th…
▽ More
Reconfigurable intelligent surfaces (RISs) have emerged as a cost- and energy-efficient technology that can customize and program the physical propagation environment by reflecting radio waves in preferred directions. However, the purely passive reflection of RISs not only limits the end-to-end channel beamforming gains, but also hinders the acquisition of accurate channel state information for the phase control at RISs. In this paper, we provide an overview of a hybrid relay-reflecting intelligent surface (HR-RIS) architecture, in which only a few elements are active and connected to power amplifiers and radio frequency chains. The introduction of a small number of active elements enables a remarkable system performance improvement which can also compensate for losses due to hardware impairments such as the deployment of limited-resolution phase shifters. Particularly, the active processing facilitates efficient channel estimation and localization at HR-RISs. We present two practical architectures for HR-RISs, namely, fixed and dynamic HR-RISs, and discuss their applications to beamforming, channel estimation, and localization. The benefits, key challenges, and future research directions for HR-RIS-aided communications are also highlighted. Numerical results for an exemplary deployment scenario show that HR-RISs with only four active elements can attain up to 42.8 percent and 41.8 percent improvement in spectral efficiency and energy efficiency, respectively, compared with conventional RISs.
△ Less
Submitted 18 June, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
Hybrid Relay-Reflecting Intelligent Surface-Assisted Wireless Communication
Authors:
Nhan Thanh Nguyen,
Quang-Doanh Vu,
Kyungchun Lee,
Markku Juntti
Abstract:
Reconfigurable intelligent surface (RIS) has emerged as a cost- and energy-efficient solution to enhance the wireless communication capacity. However, recent studies show that a very large surface is required for a RIS-assisted communication system; otherwise, they may be outperformed by the conventional relay. Furthermore, the performance gain of a RIS can be considerably degraded by hardware imp…
▽ More
Reconfigurable intelligent surface (RIS) has emerged as a cost- and energy-efficient solution to enhance the wireless communication capacity. However, recent studies show that a very large surface is required for a RIS-assisted communication system; otherwise, they may be outperformed by the conventional relay. Furthermore, the performance gain of a RIS can be considerably degraded by hardware impairments such as limited-resolution phase shifters. To overcome those challenges, we propose a novel concept of hybrid relay-reflecting intelligent surface (HR-RIS), in which a single or few elements are deployed with power amplifiers (PAs) to serve as active relays, while the remaining elements only reflect the incident signals. Two architectures are proposed, including the fixed and dynamic HR-RIS. Their coefficient matrices are obtained based on alternating optimization (AO) and power allocation strategies, which enable understanding the fundamental performances of RIS and relaying-based systems with a trade-off between the two. The simulation results show that a significant improvement in both the spectral efficiency (SE) and energy efficiency (EE) with respect to the conventional RIS-aided system can be attained by the proposed schemes, especially, by the dynamic HR-RIS. In particular, the favorable design and deployment of the HR-RIS are analytically derived and numerically justified.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations
Authors:
Ha Q. Nguyen,
Khanh Lam,
Linh T. Le,
Hieu H. Pham,
Dat Q. Tran,
Dung B. Nguyen,
Dung D. Le,
Chi M. Pham,
Hang T. T. Tong,
Diep H. Dinh,
Cuong D. Do,
Luu T. Doan,
Cuong N. Nguyen,
Binh T. Nguyen,
Que V. Nguyen,
Au D. Hoang,
Hien N. Phan,
Anh T. Nguyen,
Phuong H. Ho,
Dat T. Ngo,
Nghia T. Nguyen,
Nhan T. Nguyen,
Minh Dao,
Van Vu
Abstract:
Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam…
▽ More
Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam. Out of this raw data, we release 18,000 images that were manually annotated by a total of 17 experienced radiologists with 22 local labels of rectangles surrounding abnormalities and 6 global labels of suspected diseases. The released dataset is divided into a training set of 15,000 and a test set of 3,000. Each scan in the training set was independently labeled by 3 radiologists, while each scan in the test set was labeled by the consensus of 5 radiologists. We designed and built a labeling platform for DICOM images to facilitate these annotation procedures. All images are made publicly available (https://www.physionet.org/content/vindr-cxr/1.0.0/) in DICOM format along with the labels of both the training set and the test set.
△ Less
Submitted 20 March, 2022; v1 submitted 29 December, 2020;
originally announced December 2020.
-
A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network
Authors:
Thi Ngoc Tho Nguyen,
Ngoc Khanh Nguyen,
Huy Phan,
Lam Pham,
Kenneth Ooi,
Douglas L. Jones,
Woon-Seng Gan
Abstract:
Polyphonic sound event detection and localization (SELD) task is challenging because it is difficult to jointly optimize sound event detection (SED) and direction-of-arrival (DOA) estimation in the same network. We propose a general network architecture for SELD in which the SELD network comprises sub-networks that are pretrained to solve SED and DOA estimation independently, and a recurrent layer…
▽ More
Polyphonic sound event detection and localization (SELD) task is challenging because it is difficult to jointly optimize sound event detection (SED) and direction-of-arrival (DOA) estimation in the same network. We propose a general network architecture for SELD in which the SELD network comprises sub-networks that are pretrained to solve SED and DOA estimation independently, and a recurrent layer that combines the SED and DOA estimation outputs into SELD outputs. The recurrent layer does the alignment between the sound classes and DOAs of sound events while being unaware of how these outputs are produced by the upstream SED and DOA estimation algorithms. This simple network architecture is compatible with different existing SED and DOA estimation algorithms. It is highly practical since the sub-networks can be improved independently. The experimental results using the DCASE 2020 SELD dataset show that the performances of our proposed network architecture using different SED and DOA estimation algorithms and different audio formats are competitive with other state-of-the-art SELD algorithms. The source code for the proposed SELD network architecture is available at Github.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
Enhancing MRI Brain Tumor Segmentation with an Additional Classification Network
Authors:
Hieu T. Nguyen,
Tung T. Le,
Thang V. Nguyen,
Nhan T. Nguyen
Abstract:
Brain tumor segmentation plays an essential role in medical image analysis. In recent studies, deep convolution neural networks (DCNNs) are extremely powerful to tackle tumor segmentation tasks. We propose in this paper a novel training method that enhances the segmentation results by adding an additional classification branch to the network. The whole network was trained end-to-end on the Multimo…
▽ More
Brain tumor segmentation plays an essential role in medical image analysis. In recent studies, deep convolution neural networks (DCNNs) are extremely powerful to tackle tumor segmentation tasks. We propose in this paper a novel training method that enhances the segmentation results by adding an additional classification branch to the network. The whole network was trained end-to-end on the Multimodal Brain Tumor Segmentation Challenge (BraTS) 2020 training dataset. On the BraTS's validation set, it achieved an average Dice score of 78.43%, 89.99%, and 84.22% respectively for the enhancing tumor, the whole tumor, and the tumor core.
△ Less
Submitted 28 October, 2020; v1 submitted 25 September, 2020;
originally announced September 2020.
-
Intelligent Radio Signal Processing: A Survey
Authors:
Quoc-Viet Pham,
Nhan Thanh Nguyen,
Thien Huynh-The,
Long Bao Le,
Kyungchun Lee,
Won-Joo Hwang
Abstract:
Intelligent signal processing for wireless communications is a vital task in modern wireless systems, but it faces new challenges because of network heterogeneity, diverse service requirements, a massive number of connections, and various radio characteristics. Owing to recent advancements in big data and computing technologies, artificial intelligence (AI) has become a useful tool for radio signa…
▽ More
Intelligent signal processing for wireless communications is a vital task in modern wireless systems, but it faces new challenges because of network heterogeneity, diverse service requirements, a massive number of connections, and various radio characteristics. Owing to recent advancements in big data and computing technologies, artificial intelligence (AI) has become a useful tool for radio signal processing and has enabled the realization of intelligent radio signal processing. This survey covers four intelligent signal processing topics for the wireless physical layer, including modulation classification, signal detection, beamforming, and channel estimation. In particular, each theme is presented in a dedicated section, starting with the most fundamental principles, followed by a review of up-to-date studies and a summary. To provide the necessary background, we first present a brief overview of AI techniques such as machine learning, deep learning, and federated learning. Finally, we highlight a number of research challenges and future directions in the area of intelligent radio signal processing. We expect this survey to be a good source of information for anyone interested in intelligent radio signal processing, and the perspectives we provide therein will stimulate many more novel ideas and contributions in the future.
△ Less
Submitted 3 June, 2021; v1 submitted 19 August, 2020;
originally announced August 2020.
-
A Sequence Matching Network for Polyphonic Sound Event Localization and Detection
Authors:
Thi Ngoc Tho Nguyen,
Douglas L. Jones,
Woon-Seng Gan
Abstract:
Polyphonic sound event detection and direction-of-arrival estimation require different input features from audio signals. While sound event detection mainly relies on time-frequency patterns, direction-of-arrival estimation relies on magnitude or phase differences between microphones. Previous approaches use the same input features for sound event detection and direction-of-arrival estimation, and…
▽ More
Polyphonic sound event detection and direction-of-arrival estimation require different input features from audio signals. While sound event detection mainly relies on time-frequency patterns, direction-of-arrival estimation relies on magnitude or phase differences between microphones. Previous approaches use the same input features for sound event detection and direction-of-arrival estimation, and train the two tasks jointly or in a two-stage transfer-learning manner. We propose a two-step approach that decouples the learning of the sound event detection and directional-of-arrival estimation systems. In the first step, we detect the sound events and estimate the directions-of-arrival separately to optimize the performance of each system. In the second step, we train a deep neural network to match the two output sequences of the event detector and the direction-of-arrival estimator. This modular and hierarchical approach allows the flexibility in the system design, and increase the performance of the whole sound event localization and detection system. The experimental results using the DCASE 2019 sound event localization and detection dataset show an improved performance compared to the previous state-of-the-art solutions.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
A two-step system for sound event localization and detection
Authors:
T. N. T. Nguyen,
D. L. Jones,
R. Ranjan,
S. Jayabalan,
W. S. Gan
Abstract:
Sound event detection and sound event localization requires different features from audio input signals. While sound event detection mainly relies on time-frequency patterns to distinguish different event classes, sound event localization uses magnitude or phase differences between microphones to estimate source directions. Therefore, we propose a two-step system to do sound event localization and…
▽ More
Sound event detection and sound event localization requires different features from audio input signals. While sound event detection mainly relies on time-frequency patterns to distinguish different event classes, sound event localization uses magnitude or phase differences between microphones to estimate source directions. Therefore, we propose a two-step system to do sound event localization and detection. In the first step, we detect the sound events and estimate the directions-of-arrival separately. In the second step, we combine the results of the event detector and direction-of-arrival estimator together. The obtained results show a significant improvement over the baseline solution for sound event localization and detection in DCASE 2019 task 3 challenge. Using the evaluation dataset, the proposed system achieved an F1 score of 93.4% for sound event detection and an error of 5.4 degrees for direction-of-arrival estimation, while the winning solution achieved an F1 score of 94.7% and an angle error of 3.7 degrees respectively.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Unequally Sub-connected Architecture for Hybrid Beamforming in Massive MIMO Systems
Authors:
Nhan Thanh Nguyen,
Kyungchun Lee
Abstract:
A variety of hybrid analog-digital beamforming architectures have recently been proposed for massive multiple-input multiple-output (MIMO) systems to reduce energy consumption and the cost of implementation. In the analog processing network of these architectures, the practical sub-connected structure requires lower power consumption and hardware complexity than the fully connected structure but c…
▽ More
A variety of hybrid analog-digital beamforming architectures have recently been proposed for massive multiple-input multiple-output (MIMO) systems to reduce energy consumption and the cost of implementation. In the analog processing network of these architectures, the practical sub-connected structure requires lower power consumption and hardware complexity than the fully connected structure but cannot fully exploit the beamforming gains, which leads to a loss in overall performance. In this work, we propose a novel unequal sub-connected architecture for hybrid combining at the receiver of a massive MIMO system that employs unequal numbers of antennas in sub-antenna arrays. The optimal design of the proposed architecture is analytically derived, and includes antenna allocation and channel ordering schemes. Simulation results show that an enhancement of up to 10% can be attained in the total achievable rate by unequally assigning antennas to sub-arrays in the sub-connected system at the cost of a marginal increase in power consumption. Furthermore, in order to reduce the computational complexity involved in finding the optimal number of antennas connected to each radio frequency (RF) chain, we propose three low-complexity antenna allocation algorithms. The simulation results show that they can yield a significant reduction in complexity while achieving near-optimal performance.
△ Less
Submitted 27 August, 2019;
originally announced August 2019.