Skip to main content

Control and Robotics


Modern Control, Speech Processing & Robotics Notes

Beginner-Friendly Notes on Modern Control Topics

1. Control of Robots Through Speech Signals

1.1 Introduction

Speech-controlled robots combine speech processing, artificial intelligence, and control systems to allow robots to respond to human voice commands.

1.2 Key Components

  • Speech Signal Processing: Noise reduction and feature extraction.
  • Automatic Speech Recognition (ASR): Converts speech to text.
  • Language Understanding: Interprets text into commands.
  • Robot Control System: Executes motion using PID, LQR, MPC, etc.

1.3 Challenges

  • Noise and variability in speech
  • Real-time processing
  • Safety during robot-human interaction

1.4 Applications

  • Assistive robots
  • Industrial automation
  • Service robots

2. Autonomous Vehicles (AVs)

2.1 Introduction

Autonomous vehicles use sensors, perception algorithms, trajectory planning, and control systems to operate without human input.

2.2 Architecture

  1. Perception: Cameras, LiDAR, RADAR.
  2. Localization: GPS and SLAM.
  3. Decision & Planning: Behavior and path planning.
  4. Control: Steering, throttle, braking using PID, MPC, etc.

2.3 Challenges

  • Uncertain environments
  • Weather effects
  • Human driver interaction
  • Safety and regulations

2.4 Applications

  • Self-driving cars
  • Delivery robots
  • Autonomous public transport

3. Control for Differential Games

3.1 Introduction

Differential games study control problems involving two or more players with conflicting objectives, influencing the same dynamic system.

3.2 Basic Form

dot{x} = f(x, u, v)

3.3 Value Function

V(x,t) = inf_u sup_v J(u, v)

3.4 Hamilton–Jacobi–Isaacs (HJI) Equation

∂V/∂t + H(x, ∇V) = 0

The HJI equation describes optimal behavior in adversarial dynamic systems.

3.5 Applications

  • Pursuit–evasion games
  • Collision avoidance
  • Multi-agent autonomous driving
  • Security and defense

4. Control of Structures

4.1 Introduction

Structural control aims to reduce vibrations and improve stability in buildings, bridges, towers, and aerospace structures.

4.2 Types of Control

  • Passive Control: TMDs, base isolators.
  • Active Control: Uses actuators and sensors.
  • Semi-active Control: MR dampers and variable devices.

4.3 Control Methods

  • PID
  • LQR
  • H-infinity control
  • MPC

4.4 Applications

  • Earthquake-resistant buildings
  • Wind-resistant tall structures
  • Aerospace vibration control

5. Summary Table

Topic What It Studies Key Tools Applications
Speech-Controlled Robots Robots responding to voice commands ASR, NLP, PID, MPC Service robots, healthcare
Autonomous Vehicles Self-driving navigation & safety Perception, MPC, Trajectory Planning Cars, drones, delivery systems
Differential Games Control with competing players HJI equations, value functions Defense, multi-agent control
Structural Control Vibration suppression LQR, H∞, dampers Buildings, bridges, aircraft

Formant Control Coding (FCC) in Speech Synthesis

Formant Control Coding (FCC) is a method used in speech synthesis and speech signal processing to manipulate and control the formants of human speech. Formants are the resonant frequencies of the vocal tract that are responsible for the distinct sound quality of vowels and other speech sounds. In a speech synthesis system, controlling these formants is crucial for producing more natural-sounding synthetic speech.

What are Formants?

Formants are the key frequency bands in the vocal spectrum that shape the sound of speech. These are created by the resonance of the vocal tract when sound is produced by the vocal cords. The first two or three formants (F1, F2, and sometimes F3) are particularly important for vowel identification:

  • F1 (First Formant): Corresponds to openness of the vocal tract.
  • F2 (Second Formant): Front-back spectrum dimension.
  • F3 (Third Formant): Helps distinguish specific consonants and vowels.

In synthetic speech, formants are often modeled and synthesized artificially in order to replicate the speech characteristics of a human voice.

What is Formant Control?

In speech synthesis systems, formant control involves manipulating these formants (primarily F1, F2, and F3) to change the quality, tone, and intelligibility of synthetic speech.

Formant Control Coding (FCC) is a specialized technique for encoding, controlling, and modifying these formants for different purposes, such as improving speech quality, mimicking certain voices, or adjusting speech to different acoustic environments.

How Formant Control Coding Works

1. Formant Analysis

LPC and other spectral tools estimate human formants.

2. Formant Encoding

Formants are encoded efficiently for synthesis.

3. Formant Synthesis

Formants are used with a vocal tract model to generate speech.

4. Control Mechanism

  • Pitch control via formant shifts.
  • Voice quality changes via spectral transformation.

5. Naturalness Adjustment

FCC allows fine tuning of voice timbre, identity, emotion, etc.

Applications of Formant Control Coding

  • TTS Systems
  • Speech Synthesis in Robotics
  • Voice Transformation & Mimicry
  • Assistive Technology

Advantages of FCC

  • Improved naturalness
  • Flexibility in voice shaping
  • Better voice quality
  • Efficient encoding

Example

  • Text input: “Hello, how are you?”
  • Formants extracted then encoded.
  • Formants adjusted to produce gender/age changes.

Key Technologies Related to FCC

  • LPC
  • Harmonic Models
  • Articulatory Synthesis

Conclusion

FCC improves naturalness, flexibility, and intelligibility of synthetic speech using resonant frequency manipulation.

PID, LQR, and MPC in Robotic Control

Robotic systems use controllers to track trajectories and ensure stable behavior. Here we compare PID, LQR, and MPC.

1. PID Control

PID uses Proportional, Integral and Derivative actions:

  • P reacts to current error
  • I corrects accumulated error
  • D predicts future error

PID Equation

u(t) = Kp * e(t) + Ki * ∫ e(t') dt' + Kd * de(t)/dt

Advantages

  • Simple
  • Effective in many cases

Disadvantages

  • Tuning required
  • Poor for nonlinear systems

2. LQR Control

dot{x}(t) = A * x(t) + B * u(t)

Cost function:

J = ∫ (xแต€Qx + uแต€Ru) dt

Pros

  • Optimal
  • Works well with noise

Cons

  • Requires linear model
  • Needs accurate parameters

3. MPC Control

min ฮฃ (x - x_desired)แต€Q(x - x_desired) + uแต€Ru

Subject to:

x(k+1) = A x(k) + B u(k)

Advantages

  • Handles constraints
  • Works with nonlinear systems

Disadvantages

  • High computation
  • Requires accurate model

Comparison Table

Feature PID LQR MPC
Control Type Feedback Optimal Predictive
Constraints No No Yes
Nonlinearity Poor Poor Good

Conclusion

PID is simple but limited; LQR is optimal for linear systems; MPC is the most advanced but computationally heavy.


Further Reading


People are good at skipping over material they already know!

View Related Topics to







Contact Us

Name

Email *

Message *

Popular Posts

BER vs SNR for M-ary QAM, M-ary PSK, QPSK, BPSK, ...

๐Ÿ“˜ Overview of BER and SNR ๐Ÿงฎ Online Simulator for BER calculation of m-ary QAM and m-ary PSK ๐Ÿงฎ MATLAB Code for BER calculation of M-ary QAM, M-ary PSK, QPSK, BPSK, ... ๐Ÿ“š Further Reading ๐Ÿ“‚ View Other Topics on M-ary QAM, M-ary PSK, QPSK ... ๐Ÿงฎ Online Simulator for Constellation Diagram of m-ary QAM ๐Ÿงฎ Online Simulator for Constellation Diagram of m-ary PSK ๐Ÿงฎ MATLAB Code for BER calculation of ASK, FSK, and PSK ๐Ÿงฎ MATLAB Code for BER calculation of Alamouti Scheme ๐Ÿงฎ Different approaches to calculate BER vs SNR What is Bit Error Rate (BER)? The abbreviation BER stands for Bit Error Rate, which indicates how many corrupted bits are received (after the demodulation process) compared to the total number of bits sent in a communication process. BER = (number of bits received in error) / (total number of tran...

Constellation Diagrams of ASK, PSK, and FSK

๐Ÿ“˜ Overview of Energy per Bit (Eb / N0) ๐Ÿงฎ Online Simulator for constellation diagrams of ASK, FSK, and PSK ๐Ÿงฎ Theory behind Constellation Diagrams of ASK, FSK, and PSK ๐Ÿงฎ MATLAB Codes for Constellation Diagrams of ASK, FSK, and PSK ๐Ÿ“š Further Reading ๐Ÿ“‚ Other Topics on Constellation Diagrams of ASK, PSK, and FSK ... ๐Ÿงฎ Simulator for constellation diagrams of m-ary PSK ๐Ÿงฎ Simulator for constellation diagrams of m-ary QAM BASK (Binary ASK) Modulation: Transmits one of two signals: 0 or -√Eb, where Eb​ is the energy per bit. These signals represent binary 0 and 1.    BFSK (Binary FSK) Modulation: Transmits one of two signals: +√Eb​ ( On the y-axis, the phase shift of 90 degrees with respect to the x-axis, which is also termed phase offset ) or √Eb (on x-axis), where Eb​ is the energy per bit. These signals represent binary 0 and 1.  BPSK (Binary PSK) Modulation: Transmits one of two signals...

MATLAB Code for ASK, FSK, and PSK

๐Ÿ“˜ Overview & Theory ๐Ÿงฎ MATLAB Code for ASK ๐Ÿงฎ MATLAB Code for FSK ๐Ÿงฎ MATLAB Code for PSK ๐Ÿงฎ Simulator for binary ASK, FSK, and PSK Modulations ๐Ÿ“š Further Reading ASK, FSK & PSK HomePage MATLAB Code MATLAB Code for ASK Modulation and Demodulation % The code is written by SalimWireless.Com % Clear previous data and plots clc; clear all; close all; % Parameters Tb = 1; % Bit duration (s) fc = 10; % Carrier frequency (Hz) N_bits = 10; % Number of bits Fs = 100 * fc; % Sampling frequency (ensure at least 2*fc, more for better representation) Ts = 1/Fs; % Sampling interval samples_per_bit = Fs * Tb; % Number of samples per bit duration % Generate random binary data rng(10); % Set random seed for reproducibility binary_data = randi([0, 1], 1, N_bits); % Generate random binary data (0 or 1) % Initialize arrays for continuous signals t_overall = 0:Ts:(N_bits...

Theoretical vs. simulated BER vs. SNR for ASK, FSK, and PSK

๐Ÿ“˜ Overview ๐Ÿงฎ Simulator for calculating BER ๐Ÿงฎ MATLAB Codes for calculating theoretical BER ๐Ÿงฎ MATLAB Codes for calculating simulated BER ๐Ÿ“š Further Reading BER vs. SNR denotes how many bits in error are received for a given signal-to-noise ratio, typically measured in dB. Common noise types in wireless systems: 1. Additive White Gaussian Noise (AWGN) 2. Rayleigh Fading AWGN adds random noise; Rayleigh fading attenuates the signal variably. A good SNR helps reduce these effects. Simulator for calculating BER vs SNR for binary ASK, FSK, and PSK Calculate BER for Binary ASK Modulation Enter SNR (dB): Calculate BER Calculate BER for Binary FSK Modulation Enter SNR (dB): Calculate BER Calculate BER for Binary PSK Modulation Enter SNR (dB): Calculate BER BER vs. SNR Curves MATLAB Code for Theoretical BER % The code is written by SalimWireless.Com clc; clear; close all; % SNR va...

Comparisons among ASK, PSK, and FSK | And the definitions of each

๐Ÿ“˜ Comparisons among ASK, FSK, and PSK ๐Ÿงฎ Online Simulator for calculating Bandwidth of ASK, FSK, and PSK ๐Ÿงฎ MATLAB Code for BER vs. SNR Analysis of ASK, FSK, and PSK ๐Ÿ“š Further Reading ๐Ÿ“‚ View Other Topics on Comparisons among ASK, PSK, and FSK ... ๐Ÿงฎ Comparisons of Noise Sensitivity, Bandwidth, Complexity, etc. ๐Ÿงฎ MATLAB Code for Constellation Diagrams of ASK, FSK, and PSK ๐Ÿงฎ Online Simulator for ASK, FSK, and PSK Generation ๐Ÿงฎ Online Simulator for ASK, FSK, and PSK Constellation ๐Ÿงฎ Some Questions and Answers Modulation ASK, FSK & PSK Constellation MATLAB Simulink MATLAB Code Comparisons among ASK, PSK, and FSK    Comparisons among ASK, PSK, and FSK Comparison among ASK, FSK, and PSK Parameters ASK FSK PSK Variable Characteristics Amplitude Frequency ...

MATLAB code for BER vs SNR for M-QAM, M-PSK, QPSk, BPSK, ...

๐Ÿงฎ MATLAB Code for BPSK, M-ary PSK, and M-ary QAM Together ๐Ÿงฎ MATLAB Code for M-ary QAM ๐Ÿงฎ MATLAB Code for M-ary PSK ๐Ÿ“š Further Reading MATLAB Script for BER vs. SNR for M-QAM, M-PSK, QPSK, BPSK % Written by Salim Wireless clc; clear; close all; num_symbols = 1e5; snr_db = -20:2:20; psk_orders = [2, 4, 8, 16, 32]; qam_orders = [4, 16, 64, 256]; ber_psk_results = zeros(length(psk_orders), length(snr_db)); ber_qam_results = zeros(length(qam_orders), length(snr_db)); for i = 1:length(psk_orders) psk_order = psk_orders(i); for j = 1:length(snr_db) data_symbols = randi([0, psk_order-1], 1, num_symbols); modulated_signal = pskmod(data_symbols, psk_order, pi/psk_order); received_signal = awgn(modulated_signal, snr_db(j), 'measured'); demodulated_symbols = pskdemod(received_signal, psk_order, pi/psk_order); ber_psk_results(i, j) = sum(data_symbols ~= demodulated_symbols) / num_symbols; end end for i...

MATLAB Code for Constellation Diagrams of ASK, FSK, and PSK

๐Ÿ“˜ Overview & Theory ๐Ÿงฎ MATLAB Code for Constellation Diagrams of ASK, FSK, and PSK ๐Ÿงฎ Online Simulator for Constellation diagrams of ASK, FSK, and PSK ๐Ÿ“š Further Reading   MATLAB Script % The code is developed by SalimWireless.Com clc; clear; close all; % Parameters numSymbols = 1000; % Number of symbols to simulate symbolIndices = randi([0 1], numSymbols, 1); % Random binary symbols (0 or 1) % ASK Modulation (BASK) askAmplitude = [0, 1]; % Amplitudes for binary ASK askSymbols = askAmplitude(symbolIndices + 1); % Modulated BASK symbols % FSK Modulation (Modified BFSK with 90-degree offset) fs = 100; % Sampling frequency symbolDuration = 1; % Symbol duration in seconds t = linspace(0, symbolDuration, fs*symbolDuration); fBase = 1; % Base frequency frequencies = [fBase, fBase]; % Same frequency for both % Generate FSK symbols with 90° phase offset fskSymbols = arrayfun(@(idx) ...     cos(2*pi*frequencies(1)*t) * (1-idx) + ...     ...

What are Precoding and Combining Weights / Matrices in a MIMO Beamforming System

MIMO / Massive MIMO Beamforming Techniques Precoding and Combining Weights... Configuration of single-user digital precoder for millimeter-wave massive MIMO system Precoding and combining are two excellent ways to send and receive signals over a multi-antenna communication process, respectively (i.e., MIMO antenna communication ). The channel matrix is the basis of both the precoding and combining matrices. Precoding matrices are typically used on the transmitter side and combining matrices on the receiving side. The two matrices allow us to generate multiple simultaneous data streams between the transmitter and receiver. The nature of the data streams is also orthogonal, which helps decrease or cancel (theoretically) interference between any two data streams. For a MIMO system, the channel matrix can be effectively **diago...