jjshoots / pyflyt Goto Github PK

View Code? Open in Web Editor NEW

93.0 4.0 17.0 249.92 MB

UAV Flight Simulator for Reinforcement Learning Research

Home Page: https://taijunjet.com/PyFlyt/documentation.html

License: MIT License

Python 99.47% Shell 0.08% Makefile 0.20% Batchfile 0.25%

drones pybullet reinforcement-learning reinforcement-learning-environments uav unmanned-aerial-vehicle

pyflyt's Issues

Multi-agent with pettingzoo API

I'm trying multi-agent PyFlyt using pettingzoo and SuperSuit.

Where can I find material to learn how to do it and where to start? I tried following the Multi-Agent Deep Reinforcement Learning in 13 Lines of Code Using PettingZoo but I get stuck on many errors such as:

assert aec_env.metadata.get("is_parallelizable", False), (
AssertionError: Converting from an AEC environment to a Parallel environment with the to_parallel wrapper is not generally safe (the AEC environment should only update once at the end of each cycle). If you have confirmed that your AEC environment can be converted in this way, then please set the `is_parallelizable` key in your metadata to True

And after adding the metada is_parallelizable to True, the following:

for pipe, proc in zip(self.pipes, self.procs):
AttributeError: 'ProcConcatVec' object has no attribute 'pipes'

Any help would be greatly appreciated.

Thanks in advance!

request to share one reinforcement learning example

Hello @jjshoots

Thank you for sharing amazing and up to date PyBullet based crazyflie repository. It is really impressive work. I have a favor to ask. Could you please share some RL example?

Many thanks.

Drone with suspended rope?

Hi,

I am interested in creating an environment where there is a rope attached to the drone and it is potentially carrying a payload. Similar to what is described in this research paper.
Do you have an idea of how to realize this with PyBullet? I have not found any established solutions on google yet.

Thanks

Questions about simulation time and wall clock time.

Hi @jjshoots,

Thank you for your excellent work! It has been very helpful for my projects.

As a new user of this repository and PyBullet, I hope my questions are not too stupid.

Q1: When running the gym_env demos, I noticed that the Real-Time Factors (RTFs) are always under 0.5. Does this indicate that the simulation time is about half as slow as the wall clock time? I suspect this might be due to camera rendering, as the RTFs are close to 1 when running simple examples from the examples folder. Additionally, I've tested this on three machines: my MacBook with an M1 CPU shows the lowest RTFs, around 0.05, while my P52 laptop with a P2000 GPU and a workstation with a 3090 GPU both show RTFs around 0.5.

I need better alignment between simulation time and wall clock time because I'm working on an imitation learning project. In imitation learning, it's crucial to collect human demonstration trajectories, generated when humans interact with an environment by sending action instructions at regular intervals. The environment changes its state according to these instructions, allowing humans to observe these changes and generate complete demonstration trajectories. When these trajectories are used in imitation learning algorithms (often involving reinforcement learning processes), ensuring the agent's interaction with the environment mirrors human interaction is essential. If the time progression in the simulation (such as PyBullet) doesn't match real-time (wall clock time), the agent and human may observe different environmental states within the same time interval, even with the same initial state and actions.

Q2: Is it possible to align the simulation time in gym envs with wall clock time? I believe the RTF should be around 1, correct?

Thank you again, and I look forward to your response.

Insights in Published Results

Hi,

Thank you for your previous answer, it was very helpful.

I am looking to hopefully use this great simulation to test a new RL method of my own. In your published paper, you reference some results of different algorithms w/ different reward structures on different envs. (See below)

For example, it would be useful to see the architecture, training params for the SAC-Dense model.

I was wondering if you had any openly available code/repo that shows how you trained these agents, as information regarding architecture, training params, etc. is limited. I would love to have a look at the code for this.

Many thanks, :)
Dan

Bug: Render doesn't simulate time correctly

In Aviary.py, I think the following line:

self._sim_elapsed += self.update_period * self.updates_per_step

should be changed to:

self._sim_elapsed += self.update_period

or the rendering will not be in sync with the simulation time and RTF won't be calculated correctly.

Make rendering GUI optional when render_mode is set to rgb_array

Make rednering GUI optional when render_mode is set to rgb_array.

It is difficult to generate images in a headless environment because we force PyBullet to spawn the GUI if a render_mode is not none.

Minor Issues in Wind Model

In aviary.py:

The following code:

        elif callable(self.wind_type):
            # custom wind field, initialize and check
            self.wind_field = self.wind_type(
                np_random=self.np_random, **self.wind_options
            )
            WindFieldClass._check_wind_field_validity(self.wind_field)
            self.wind_field = self.wind_type(
                np_random=self.np_random, **self.wind_options
            )

can be slightly optimized to be:

        elif callable(self.wind_type):
            # custom wind field, initialize and check
            self.wind_field = self.wind_type(
                np_random=self.np_random, **self.wind_options
            )
            WindFieldClass._check_wind_field_validity(self.wind_field)

as there is no need to re-initialize the wind_field.

Also, in the example inside base_wind_field.py

The following part:

        >>>         # but the z velocity varies to the log of height,
        >>>         wind[:, -1] = np.log(position[:, -1]) * self.strength

should be modified slightly to be:

        >>>         # but the z velocity varies to the log of height,
        >>>         height = np.clip(position[:, -1] + 1, 0, None)
        >>>         wind[:, -1] = np.log(height) * self.strength

Where +1 is added so that a height of 0 equals np.log(1) = 0 and avoid np.log(0) = -inf.

I also implemented this in a simple concrete wind_field, I can create a pull request with this simple wind_field if you want.

Figure out how to handle camera rendering for environments

https://github.com/jjshoots/PyFlyt/blob/master/PyFlyt/gym_envs/pyflyt_base_env.py#L138

Clarification: Motor Negative PWM

In the function "physics_update" in the file "motors.py" it is allowed to pass PWM within [-1, 1].

However, in the function "_jitted_compute_thrust_torque", I'm not sure this is handled correctly as demonstrated below.

# rpm to thrust and torque
thrust = (rpm**2) * thrust_coef * thrust_unit
torque = (rpm**2) * torque_coef * thrust_unit

The previous lines of codes will convert the negative rpm into a positive rpm thus it will also have the effect of positive thrust.

This also has been tested with a simulation, I can confirm for a quadcopter with mode '-1' a PWM value of -1 is equivalent to a PWM value of 1.

Also, I'm not sure I get the logical meaning behind it, At first I thought it's to normalize the pwm value.

If someone can clarify this further would be very thankful.

Critical Bug : Rotational Damping not applied correctly for QuadX

In quadx.py, the following line:

drag_pqr = -self.drag_coef_pqr * (np.array(self.state[0]) ** 2)

Should be modified to be:

        # simulate rotational damping
        drag_pqr = (
            -np.sign(self.state[0])
            * self.drag_coef_pqr
            * (np.array(self.state[0]) ** 2)
        )

Observation space with FlattenWaypointEnv

I think shape should be (self.attitude_shape + self.context_length * self.target_shape,)

PyFlyt/PyFlyt/gym_envs/utils/flatten_waypoint_env.py

Line 35 in b985131

low=-np.inf, high=np.inf, shape=(self.attitude_shape + self.target_shape,)

Support custom environments

great job!I would like to ask, does the project support custom environments? For example, changing the simulation environment, adding obstacles, etc.?Thanks.

Using pwm as actions in the environment

Hi! I want to use an action in the environment in the form of PWM, that is, that I can pass the action = np.array(shape(4),) in the env.step() method. QuadXBaseEnv environment. Should I set env.set_mode(-1) or (7) ? And what dimensions of actions will there be?

For some reason simulation always runs at 0.5x realtime speed

For some reason, when running the aviary with rendering enabled, the simulation is always set to 60 Hz.

Training code example question

Hello @jjshooots,

Sorry to opening up the issue again. I tried to run your code main.py and I am getting following message

wingman: --------------𓆩𓆪--------------
wingman: Using device cuda:0
wingman: Saving weights to weights/509088...
wingman: New training instance detected, generating weights directory in 3 seconds...

Could you please let me know how to run this code? Thank you for your help?

          Sure! Here's my current working setup: https://github.com/jjshoots/RLTeamAerialCombat/blob/main/src/vec_env_main.py

That said, if you're using the gymnasium environments, I would highly recommend something more standard like CleanRL.

Originally posted by @jjshoots in #55 (comment)

Question about camera Settings

Hello, I have a questions for you about the camera：

If I want to achieve obstacle avoidance with a drone, if I only have a forward-looking camera, does that mean that the speed direction of the drone must be consistent with the direction of the camera? Otherwise, if the drone moves backwards, then it will not be able to obtain the image behind it, will it not be able to avoid obstacles?

Looking forward to your reply~~

Sim-to-Real transfer

In https://pypi.org/project/PyFlyt/0.3.0/#simulation-or-hardware
It is mentioned that PyFlyt provides a way for a sim-real transfer. However, i cannot find it in github repo. Please let me know how I can find it.

BUG: QuadX Motor Saturation

In the current implementation of the handling of motor saturations in quadx.py

It is handled as follows:

  # deal with motor saturations
  if (high := np.max(self.pwm)) > 1.0:
      self.pwm /= high
  if (low := np.min(self.pwm)) < 0.05:
      self.pwm += (1.0 - self.pwm) / (1.0 - low) * (0.05 - low)

I think this implementation is not robust as it can happen that both the maximum is > 1 and the minimum is less than 0.05 at the same time and it will not be handled correctly.

A more robust implementation I think would be the following:
Have the MAX_PWM and MIN_PWM as parameters set to (1, 0.05) by default.

Use the following helper function instead of the previous lines:

@staticmethod
def _motor_saturation(pwm_values: np.ndarray, min_pwm: float, max_pwm: float):
    """
    This function saturates the motors pwm values proportionally to the min_pwm and max_pwm parameters.

    Returns:
        np.array of pwm values saturated between min_pwm and max_pwm.
    """

    max_value = np.max(pwm_values)
    min_value = np.min(pwm_values)

    result = pwm_values

    data_range = max_value - min_value

    if data_range == 0:  # Prevent division by zero if all values are equal
        result = np.clip(pwm_values, min_pwm, max_pwm)
    elif max_value > max_pwm or min_value < min_pwm:
        scale = (max_pwm - min_pwm) / data_range
        result = min_pwm + ((pwm_values - min_value) * scale)

    return result

Optionally we can have a boolean parameter HANDLE_MOTOR_SATURATION set to True by default.

I already have this implemented also, if you accept it let me know and I will create the PR.

Thanks.

Use a different flight mode

In the gym env:

env = gym.make("PyFlyt/QuadX-Hover-v1")

how would I change the QuadX drone flightmode to -1?

Thanks :)

Quad Pole Balaning -> External Camera

Hi,
for the QuadX Pole Balancing task I think it would make more sense to have an external camera, as in the fixedwing envs.

Thanks

Quad-X Propellers Axis of Rotation

Hello,

From what I can see the body frame is following FLU (Front Left Up) orientation, and the QuadX motors are configured as so:
Motor 0: Front Right
Motor 1: Back Left
Motor 2: Front Left
Motor 3: Back Right

quadx.py line 64-75

            """
            DRONE CONTROL
            motor ids correspond to quadrotor X in PX4,
            -------------------------
            |  (cw)2  0(ccw)        |
            |      \\//           x |
            |      //\\         y_| |
            | (ccw)1  3(cw)         |
            -------------------------
            using the ENU convention
            control commands are in the form of roll-pitch-yaw-thrust
        """

however the torque coefficient is written as so:

            torque_coef = np.array(
                [
                    -motor_params["torque_coef"],
                    -motor_params["torque_coef"],
                    +motor_params["torque_coef"],
                    +motor_params["torque_coef"],
                ]
            )

if I understand correctly, I think this is a mistake as Motors 0,1 should have a +ve torque_coeff (CCW about Z-Axis pointing up) and Motors 2,3 should have a -ve torque_coef (CW about Z-Axis pointing up)

Would it be possible to add obstacles in the scene?

Hi! This is a fantastic work for UAV simulation and RL learning. It would be nice if we can add some static and dynamic obstacles to increase the difficulty of task. I'm wondering if this is on the roadmap?

Change ReadMe example from v0 to v1

The readme fails since v0 is no longer support for quadx environments.

Update the readme to use v1 instead.

Comparison with gym-pybullet-drones

How does PyFlyt compare to gym-pybullet-drones? I'm looking at using one of the libraries to train a quadcopter using reinforcement learning. From what I can tell, in PyFlyt it might be easier to create custom quadcopter configurations. Motor ramping is also implemented in PyFlyt but it's still on the wishlist for gym-pybullet-drones.

Proposal: Quadplane

"""Implementation of a Quadplane UAV."""
from __future__ import annotations

import math

import numpy as np
import yaml
from pybullet_utils import bullet_client

from ..abstractions.base_drone import DroneClass


class Quadplane(DroneClass):
    """Quadplane instance that handles everything about a Quadplane."""

    def __init__(
        self,
        p: bullet_client.BulletClient,
        start_pos: np.ndarray,
        start_orn: np.ndarray,
        ctrl_hz: int,
        physics_hz: int,
        drone_model: str = "quadplane",
        model_dir: None | str = None,
        use_camera: bool = False,
        use_gimbal: bool = False,
        camera_angle_degrees: int = 0,
        camera_FOV_degrees: int = 90,
        camera_resolution: tuple[int, int] = (128, 128),
        np_random: None | np.random.RandomState = None,
    ):
        """Creates a quadplane UAV and handles all relevant control and physics.

        Args:
            p (bullet_client.BulletClient): p
            start_pos (np.ndarray): start_pos
            start_orn (np.ndarray): start_orn
            ctrl_hz (int): ctrl_hz
            physics_hz (int): physics_hz
            model_dir (None | str): model_dir
            drone_model (str): drone_model
            use_camera (bool): use_camera
            use_gimbal (bool): use_gimbal
            camera_angle_degrees (int): camera_angle_degrees
            camera_FOV_degrees (int): camera_FOV_degrees
            camera_resolution (tuple[int, int]): camera_resolution
            np_random (None | np.random.RandomState): np_random
        """
        super().__init__(
            p=p,
            start_pos=start_pos,
            start_orn=start_orn,
            ctrl_hz=ctrl_hz,
            physics_hz=physics_hz,
            model_dir=model_dir,
            drone_model=drone_model,
            np_random=np_random,
        )

        """Reads quadplane.yaml file and load UAV parameters"""
        with open(self.param_path, "rb") as f:
            # load all params from yaml
            all_params = yaml.safe_load(f)

            # Transitional parameters
            self.umin = all_params["umin"]
            self.umax = all_params["umax"]

            # Main wing
            main_wing_params = all_params["main_wing_params"]

            # Left flapped wing
            left_wing_flapped_params = all_params["left_wing_flapped_params"]

            # Right flapped wing
            right_wing_flapped_params = all_params["right_wing_flapped_params"]

            # Horizontal tail
            horizontal_tail_params = all_params["horizontal_tail_params"]

            # Vertical tail
            vertical_tail_params = all_params["vertical_tail_params"]

            # Front Motor
            front_motor_params = all_params["front_motor_params"]
            self.fm_t2w = front_motor_params["thrust_to_weight"]

            self.fm_kf = front_motor_params["thrust_const"]
            self.fm_km = front_motor_params["torque_const"]

            self.fm_thr_coeff = np.array([0.0, 1.0, 0.0]) * self.fm_kf
            self.fm_tor_coeff = np.array([0.0, 1.0, 0.0]) * self.fm_km
            self.fm_tor_dir = np.array([[1.0]])
            self.fm_noise_ratio = front_motor_params["motor_noise_ratio"]

            self.fm_max_rpm = np.sqrt((self.fm_t2w * 9.81) / (self.fm_kf))
            self.fm_motor_tau = 0.01

            # Quad Motors
            quad_motor_params = all_params["quad_motor_params"]
            self.qm_t2w = quad_motor_params["thrust_to_weight"]

            self.qm_kf = quad_motor_params["thrust_const"]
            self.qm_km = quad_motor_params["torque_const"]

            self.qm_thr_coeff = np.array([[0.0, 0.0, 1.0]]) * self.qm_kf
            self.qm_tor_coeff = np.array([[0.0, 0.0, 1.0]]) * self.qm_km
            self.qm_tor_dir = np.array([[1.0], [1.0], [-1.0], [-1.0]])
            self.qm_noise_ratio = quad_motor_params["motor_noise_ratio"]

            self.qm_max_rpm = np.sqrt((self.qm_t2w * 9.81) / (self.qm_kf))
            self.qm_motor_tau = 0.01

            # motor mapping from command to individual motors
            self.motor_map = np.array(
                [
                    [-1.0, -1.0, +1.0, +1.0],
                    [+1.0, +1.0, +1.0, +1.0],
                    [-1.0, +1.0, -1.0, +1.0],
                    [+1.0, -1.0, -1.0, +1.0],
                ]
            )

            # Combine [ail_left, ail_right, hori_tail, main_wing, vert_tail]
            self.aerofoil_params = [
                left_wing_flapped_params,
                right_wing_flapped_params,
                horizontal_tail_params,
                main_wing_params,
                vertical_tail_params,
            ]

        """ CAMERA """
        self.use_camera = use_camera
        if self.use_camera:
            self.proj_mat = self.p.computeProjectionMatrixFOV(
                fov=camera_FOV_degrees, aspect=1.0, nearVal=0.1, farVal=255.0
            )
            self.use_gimbal = use_gimbal
            self.camera_angle_degrees = camera_angle_degrees
            self.camera_FOV_degrees = camera_FOV_degrees
            self.camera_resolution = np.array(camera_resolution)

        """ CUSTOM CONTROLLERS """
        # dictionary mapping of controller_id to controller objects
        self.registered_controllers = dict()
        self.instanced_controllers = dict()
        self.registered_base_modes = dict()

        self.reset()

    def fm_rpm2forces(self, rpm_cmd):
        self.fm_rpm += (self.physics_hz / self.fm_motor_tau) * (
            self.fm_max_rpm * rpm_cmd - self.fm_rpm
        )

        rpm = np.expand_dims(self.fm_rpm, axis=1)
        thrust = (rpm**2) * self.fm_thr_coeff
        torque = (rpm**2) * self.fm_tor_coeff * self.fm_tor_dir

        # add some random noise to the motor output
        thrust += self.np_random.randn(*thrust.shape) * self.fm_noise_ratio * thrust
        torque += self.np_random.randn(*torque.shape) * self.fm_noise_ratio * torque

        self.p.applyExternalForce(
            self.Id, 0, thrust[0], [0.0, 0.0, 0.0], self.p.LINK_FRAME
        )
        # self.p.applyExternalTorque(self.Id, 0, torque[0], self.p.LINK_FRAME)

    def qm_rpm2forces(self, rpm_cmd):
        self.qm_rpm += (self.physics_hz / self.qm_motor_tau) * (
            self.qm_max_rpm * rpm_cmd - self.qm_rpm
        )

        rpm = np.expand_dims(self.qm_rpm, axis=1)
        thrust = (rpm**2) * self.qm_thr_coeff
        torque = (rpm**2) * self.qm_tor_coeff * self.qm_tor_dir

        # add some random noise to the motor outputs
        thrust += self.np_random.randn(*thrust.shape) * self.qm_noise_ratio * thrust
        torque += self.np_random.randn(*torque.shape) * self.qm_noise_ratio * torque

        for idx, (thr, tor) in enumerate(zip(thrust, torque)):
            self.p.applyExternalForce(
                self.Id, (idx + 7), thr, [0.0, 0.0, 0.0], self.p.LINK_FRAME
            )
            self.p.applyExternalTorque(self.Id, (idx + 7), tor, self.p.LINK_FRAME)

    def qm_cmd2rpm(self, Fz, Mpitch, Mroll, Myaw):
        # Mixes commands from pilot with transitional mixing (To be expanded to include lateral commands)
        qm_cmd = [Mpitch, Mroll, Myaw, Fz]
        qm_rpm_cmd = np.matmul(self.motor_map, qm_cmd)
        # deal with motor saturations
        if (high := np.max(qm_rpm_cmd)) > 1.0:
            qm_rpm_cmd /= high
        if (low := np.min(qm_rpm_cmd)) < 0.05:
            qm_rpm_cmd += (1.0 - qm_rpm_cmd) / (1.0 - low) * (0.05 - low)

        return qm_rpm_cmd

    def update_forces(self):
        Proll, eta, tau, Fz, Mpitch, Mroll, Myaw = self.cmd_mixing()

        qm_rpm = self.qm_cmd2rpm(Fz, Mpitch, Mroll, Myaw)

        # Front Motor Forces
        self.fm_rpm2forces(tau)

        # Quad Motors Forces
        self.qm_rpm2forces(qm_rpm)

        # Left aileron Forces
        self.left_aileron_forces(0, Proll)

        # Right aileron Forces
        self.right_aileron_forces(1, Proll)

        # Horizontal tail Forces
        self.horizontal_tail_forces(2, eta)

        # Main wing Forces
        self.main_wing_forces(3)

        # Vertical tail Forces
        self.vertical_tail_forces(4)

    def left_aileron_forces(self, i, Proll):
        vel = self.surface_vels[i]
        local_surface_vel = np.matmul((vel), self.rotation)

        alpha = np.arctan2(-local_surface_vel[2], local_surface_vel[1])
        alpha_deg = np.rad2deg(alpha)

        defl = self.aerofoil_params[i]["defl_lim"] * Proll
        [Cl, Cd, CM] = self.get_aero_data(self.aerofoil_params[i], defl, alpha_deg)

        freestream_speed = np.linalg.norm(
            [local_surface_vel[1], local_surface_vel[2]]
        )  # Only in Y and Z directions
        Q = 0.5 * 1.225 * np.square(freestream_speed)  # Dynamic pressure
        area = self.aerofoil_params[i]["chord"] * self.aerofoil_params[i]["span"]

        lift = Q * area * Cl
        drag = Q * area * Cd  # Negative coz front is positive
        pitching_moment = Q * area * CM * self.aerofoil_params[i]["chord"]

        up = (lift * np.cos(alpha)) + (drag * np.sin(alpha))
        front = (lift * np.sin(alpha)) - (drag * np.cos(alpha))

        self.p.applyExternalForce(
            self.Id,
            self.surface_ids[i],
            [0, front, up],
            [0.0, 0.0, 0.0],
            self.p.LINK_FRAME,
        )
        self.p.applyExternalTorque(
            self.Id, self.surface_ids[i], [pitching_moment, 0, 0], self.p.LINK_FRAME
        )

    def right_aileron_forces(self, i, Proll):
        """right_aileron_forces.

        Args:
            i:
            Proll:
        """

        vel = self.surface_vels[i]
        local_surface_vel = np.matmul((vel), self.rotation)

        alpha = np.arctan2(-local_surface_vel[2], local_surface_vel[1])
        alpha_deg = np.rad2deg(alpha)

        defl = self.aerofoil_params[i]["defl_lim"] * -Proll
        [Cl, Cd, CM] = self.get_aero_data(self.aerofoil_params[i], defl, alpha_deg)

        freestream_speed = np.linalg.norm(
            [local_surface_vel[1], local_surface_vel[2]]
        )  # Only in Y and Z directions
        Q = 0.5 * 1.225 * np.square(freestream_speed)  # Dynamic pressure
        area = self.aerofoil_params[i]["chord"] * self.aerofoil_params[i]["span"]

        lift = Q * area * Cl
        drag = Q * area * Cd
        pitching_moment = Q * area * CM * self.aerofoil_params[i]["chord"]

        up = (lift * np.cos(alpha)) + (drag * np.sin(alpha))
        front = (lift * np.sin(alpha)) - (drag * np.cos(alpha))

        self.p.applyExternalForce(
            self.Id,
            self.surface_ids[i],
            [0, front, up],
            [0.0, 0.0, 0.0],
            self.p.LINK_FRAME,
        )
        self.p.applyExternalTorque(
            self.Id, self.surface_ids[i], [pitching_moment, 0, 0], self.p.LINK_FRAME
        )

    def horizontal_tail_forces(self, i, eta):
        """horizontal_tail_forces.

        Args:
            i:
            eta:
        """

        vel = self.surface_vels[i]
        local_surface_vel = np.matmul((vel), self.rotation)

        alpha = np.arctan2(-local_surface_vel[2], local_surface_vel[1])
        alpha_deg = np.rad2deg(alpha)

        defl = self.aerofoil_params[i]["defl_lim"] * eta
        [Cl, Cd, CM] = self.get_aero_data(self.aerofoil_params[i], defl, alpha_deg)

        freestream_speed = np.linalg.norm(
            [local_surface_vel[1], local_surface_vel[2]]
        )  # Only in Y and Z directions
        Q = 0.5 * 1.225 * np.square(freestream_speed)  # Dynamic pressure
        area = self.aerofoil_params[i]["chord"] * self.aerofoil_params[i]["span"]

        lift = Q * area * Cl
        drag = Q * area * Cd
        pitching_moment = Q * area * CM * self.aerofoil_params[i]["chord"]

        up = (lift * np.cos(alpha)) + (drag * np.sin(alpha))
        front = (lift * np.sin(alpha)) - (drag * np.cos(alpha))

        self.p.applyExternalForce(
            self.Id,
            self.surface_ids[i],
            [0, front, up],
            [0.0, 0.0, 0.0],
            self.p.LINK_FRAME,
        )
        self.p.applyExternalTorque(
            self.Id, self.surface_ids[i], [pitching_moment, 0, 0], self.p.LINK_FRAME
        )

    def main_wing_forces(self, i):
        """main_wing_forces.

        Args:
            i:
        """

        vel = self.surface_vels[i]
        local_surface_vel = np.matmul((vel), self.rotation)

        alpha = np.arctan2(-local_surface_vel[2], local_surface_vel[1])
        alpha_deg = np.rad2deg(alpha)

        [Cl, Cd, CM] = self.get_aero_data(self.aerofoil_params[i], 0, alpha_deg)

        freestream_speed = np.linalg.norm(
            [local_surface_vel[1], local_surface_vel[2]]
        )  # Only in Y and Z directions
        Q = 0.5 * 1.225 * np.square(freestream_speed)  # Dynamic pressure
        area = self.aerofoil_params[i]["chord"] * self.aerofoil_params[i]["span"]

        lift = Q * area * Cl
        drag = Q * area * Cd
        pitching_moment = Q * area * CM * self.aerofoil_params[i]["chord"]

        up = (lift * np.cos(alpha)) + (drag * np.sin(alpha))
        front = (lift * np.sin(alpha)) - (drag * np.cos(alpha))

        self.p.applyExternalForce(
            self.Id, self.surface_ids[i], [0, front, up], [0, 0, 0], self.p.LINK_FRAME
        )
        self.p.applyExternalTorque(
            self.Id, self.surface_ids[i], [pitching_moment, 0, 0], self.p.LINK_FRAME
        )

    def vertical_tail_forces(self, i):
        """vertical_tail_forces.

        Args:
            i:
        """

        vel = self.surface_vels[i]
        local_surface_vel = np.matmul((vel), self.rotation)

        alpha = np.arctan2(-local_surface_vel[0], local_surface_vel[1])
        alpha_deg = np.rad2deg(alpha)

        defl = self.aerofoil_params[i]["defl_lim"] * -self.cmd[2]
        [Cl, Cd, CM] = self.get_aero_data(self.aerofoil_params[i], defl, alpha_deg)

        freestream_speed = np.linalg.norm(
            [local_surface_vel[0], local_surface_vel[1]]
        )  # Only in X and Y directions
        Q = 0.5 * 1.225 * np.square(freestream_speed)  # Dynamic pressure
        area = self.aerofoil_params[i]["chord"] * self.aerofoil_params[i]["span"]

        lift = Q * area * Cl
        drag = Q * area * Cd
        pitching_moment = Q * area * CM * self.aerofoil_params[i]["chord"]

        right = (lift * np.cos(alpha)) + (drag * np.sin(alpha))
        front = (lift * np.sin(alpha)) - (drag * np.cos(alpha))

        self.p.applyExternalForce(
            self.Id,
            self.surface_ids[i],
            [right, front, 0],
            [0.0, 0.0, 0.0],
            self.p.LINK_FRAME,
        )
        self.p.applyExternalTorque(
            self.Id, self.surface_ids[i], [0, 0, pitching_moment], self.p.LINK_FRAME
        )

    def get_aero_data(self, params, defl, alpha):
        """Returns Cl, Cd, and CM for a given aerofoil, control surface deflection, and alpha"""

        AR = params["span"] / params["chord"]
        defl = np.deg2rad(defl)
        alpha = np.deg2rad(alpha)

        Cl_alpha_3D = params["Cl_alpha_2D"] * (AR / (AR + ((2 * (AR + 4)) / (AR + 2))))

        theta_f = np.arccos((2 * params["flap_to_chord"]) - 1)
        tau = 1 - ((theta_f - np.sin(theta_f)) / np.pi)
        delta_Cl = Cl_alpha_3D * tau * params["eta"] * defl
        delta_Cl_max = params["flap_to_chord"] * delta_Cl

        alpha_stall_P_base = np.deg2rad(params["alpha_stall_P_base"])
        alpha_stall_N_base = np.deg2rad(params["alpha_stall_N_base"])

        alpha_0_base = np.deg2rad(params["alpha_0_base"])

        Cl_max_P = Cl_alpha_3D * (alpha_stall_P_base - alpha_0_base) + delta_Cl_max
        Cl_max_N = Cl_alpha_3D * (alpha_stall_N_base - alpha_0_base) + delta_Cl_max

        alpha_0 = alpha_0_base - (delta_Cl / Cl_alpha_3D)
        alpha_stall_P = alpha_0 + ((Cl_max_P) / Cl_alpha_3D)
        alpha_stall_N = alpha_0 + ((Cl_max_N) / Cl_alpha_3D)

        # Check if stalled
        if (alpha >= alpha_stall_P) or (alpha <= alpha_stall_N):
            if alpha >= alpha_stall_P:
                # Stall calculations to find alpha_i at stall
                Cl_stall = Cl_alpha_3D * (alpha_stall_P - alpha_0)
                alpha_i_at_stall = Cl_stall / (np.pi * AR)
                # alpha_i post-stall Pos
                alpha_i = np.interp(
                    alpha, [alpha_stall_P, np.pi / 2], [alpha_i_at_stall, 0]
                )

            elif alpha <= alpha_stall_N:
                # Stall calculations to find alpha_i at stall
                Cl_stall = Cl_alpha_3D * (alpha_stall_N - alpha_0)
                alpha_i_at_stall = Cl_stall / (np.pi * AR)
                # alpha_i post-stall Neg
                alpha_i = np.interp(
                    alpha, [-np.pi / 2, alpha_stall_N], [0, alpha_i_at_stall]
                )

            alpha_eff = alpha - alpha_0 - alpha_i
            # Drag coefficient at 90 deg dependent on deflection angle
            Cd_90 = (
                ((-4.26 * (10**-2)) * (defl**2))
                + ((2.1 * (10**-1)) * defl)
                + 1.98
            )
            CN = (
                Cd_90
                * np.sin(alpha_eff)
                * (
                    1 / (0.56 + 0.44 * abs(np.sin(alpha_eff)))
                    - 0.41 * (1 - np.exp(-17 / AR))
                )
            )
            CT = 0.5 * params["Cd_0"] * np.cos(alpha_eff)
            Cl = (CN * np.cos(alpha_eff)) - (CT * np.sin(alpha_eff))
            Cd = (CN * np.sin(alpha_eff)) + (CT * np.cos(alpha_eff))
            CM = -CN * (0.25 - (0.175 * (1 - ((2 * abs(alpha_eff)) / np.pi))))

        else:  # No stall
            Cl = Cl_alpha_3D * (alpha - alpha_0)
            alpha_i = Cl / (np.pi * AR)
            alpha_eff = alpha - alpha_0 - alpha_i
            CT = params["Cd_0"] * np.cos(alpha_eff)
            CN = (Cl + (CT * np.sin(alpha_eff))) / np.cos(alpha_eff)
            Cd = (CN * np.sin(alpha_eff)) + (CT * np.cos(alpha_eff))
            CM = -CN * (0.25 - (0.175 * (1 - ((2 * alpha_eff) / np.pi))))

        return Cl, Cd, CM

    def cmd_mixing(self):
        """Mixes the longitudinal commands [eta, tau] for plane commands and [Fz, M] for quad commands"""

        u = self.state[2][1]
        ctrl_share = (u - self.umin) / (self.umax - self.umin)
        # Saturate ctrl_share between [0, 1]
        ctrl_share = np.clip(ctrl_share, 0, 1)
        print(u, ctrl_share)
        # Plane command, eta
        eta = ctrl_share * -self.cmd[0]

        # Plane command, tau
        tau = ctrl_share * self.cmd[3]

        # Plane command, Proll
        Proll = ctrl_share * self.cmd[1]

        # Quad command, Fz
        Fz = (1 - ctrl_share) * self.cmd[3]

        # Quad command, Mpitch
        Mpitch = (1 - ctrl_share) * -self.cmd[0]

        # Quad command, Mroll
        Mroll = (1 - ctrl_share) * self.cmd[1]

        # Quad command, Myaw
        Myaw = (1 - ctrl_share) * -self.cmd[2]

        return Proll, eta, tau, Fz, Mpitch, Mroll, Myaw

    def update_state(self):
        """ang_vel, ang_pos, lin_vel, lin_pos"""
        lin_pos, ang_pos = self.p.getBasePositionAndOrientation(self.Id)
        lin_vel, ang_vel = self.p.getBaseVelocity(self.Id)

        # express vels in local frame
        rotation = np.array(self.p.getMatrixFromQuaternion(ang_pos)).reshape(3, 3).T
        lin_vel = np.matmul(rotation, lin_vel)
        ang_vel = np.matmul(rotation, ang_vel)

        # ang_pos in euler form
        ang_pos = self.p.getEulerFromQuaternion(ang_pos)
        self.state = np.stack([ang_vel, ang_pos, lin_vel, lin_pos], axis=0)

        # get the surface velocities and angles
        for i, id in enumerate(self.surface_ids):
            state = self.p.getLinkState(self.Id, id, computeLinkVelocity=True)
            self.surface_vels[i] = state[-2]

        self.orn = state[-3]
        self.rotation = np.array(self.p.getMatrixFromQuaternion(self.orn)).reshape(3, 3)

    def update_control(self):
        """runs through controllers"""
        # custom controllers run first if any
        if self.mode in self.registered_controllers.keys():
            custom_output = self.instanced_controllers[self.mode].step(
                self.state, self.setpoint
            )
            assert custom_output.shape == (
                4,
            ), f"custom controller outputting wrong shape, expected (4, ) but got {custom_output.shape}."

        # Final cmd, [Pitch, Roll, Yaw, Throttle] from [-1, 1]
        self.cmd = self.setpoint

    @property
    def view_mat(self):
        """view_mat."""
        # get the state of the camera on the robot
        camera_state = self.p.getLinkState(self.Id, 0)

        # UAV orientation
        orn = camera_state[1]
        rotation = np.array(self.p.getMatrixFromQuaternion(orn)).reshape(3, 3)
        cam_offset = [0, -3, 1]
        cam_offset_world_frame = np.matmul(cam_offset, rotation.transpose())

        # pose and rot
        position = np.array(camera_state[0]) + cam_offset_world_frame

        # simulate gimballed camera if needed
        up_vector = None
        if self.use_gimbal:
            # camera tilted downward for gimballed mode
            rot = np.array(self.p.getEulerFromQuaternion(camera_state[1]))
            rot[0] = 0.0
            rot[1] = self.camera_angle_degrees / 180 * math.pi
            rot = np.array(self.p.getQuaternionFromEuler(rot))
            rot = np.array(self.p.getMatrixFromQuaternion(rot)).reshape(3, 3)

            up_vector = np.matmul(rot, np.array([0.0, 0.0, 1.0]))
        else:
            # camera rotated upward for FPV mode
            rot = np.array(self.p.getEulerFromQuaternion(camera_state[1]))
            rot[0] += -self.camera_angle_degrees / 180 * math.pi
            rot = np.array(self.p.getQuaternionFromEuler(rot))
            rot = np.array(self.p.getMatrixFromQuaternion(rot)).reshape(3, 3)

            up_vector = np.matmul(rot, np.array([0, 0, 1]))

        # target position is 1000 units ahead of camera relative to the current camera pos
        target = camera_state[0]

        return self.p.computeViewMatrix(
            cameraEyePosition=position,
            cameraTargetPosition=target,
            cameraUpVector=up_vector,
        )

    def capture_image(self):
        """capture_image."""
        _, _, self.rgbaImg, self.depthImg, self.segImg = self.p.getCameraImage(
            width=self.camera_resolution[1],
            height=self.camera_resolution[0],
            viewMatrix=self.view_mat,
            projectionMatrix=self.proj_mat,
        )
        self.rgbaImg = np.array(self.rgbaImg).reshape(*self.camera_resolution, -1)
        self.depthImg = np.array(self.depthImg).reshape(*self.camera_resolution, -1)
        self.segImg = np.array(self.segImg).reshape(*self.camera_resolution, -1)

        UAV_pos = self.state[-1]
        UAV_yaw = np.rad2deg(np.arctan2(-UAV_pos[0], UAV_pos[1]))
        UAV_pitch = np.rad2deg(
            np.abs(np.arctan(UAV_pos[2] / np.linalg.norm([UAV_pos[0], UAV_pos[1]])))
        )

        self.p.resetDebugVisualizerCamera(
            cameraDistance=5,
            cameraYaw=UAV_yaw,
            cameraPitch=UAV_pitch,
            cameraTargetPosition=[0, 0, 5],
        )

        return np.array(self.rgbaImg).reshape(
            self.camera_resolution[1], self.camera_resolution[0], -1
        )

    def reset(self):
        """reset.
        """
        self.set_mode(1)
        self.setpoint = np.zeros(4)
        self.fm_rpm = np.zeros(1)
        self.qm_rpm = np.zeros(4)
        self.cmd = np.zeros(4)
        self.surface_orns = [0] * 5
        self.surface_vels = [0] * 5

        self.p.resetBasePositionAndOrientation(self.Id, self.start_pos, self.start_orn)

        self.p.resetBaseVelocity(self.Id, [0, 0, 0], [0, 0, 0])

        # [ail_left, ail_right, hori_tail, main_wing, vert_tail]
        # Maps .urdf idx to surface_ids
        self.surface_ids = [3, 4, 1, 5, 2]
        self.update_state()

        if self.use_camera:
            self.capture_image()
        pass

    def set_mode(self, mode):
        """
        Mode 1 - [Roll, Pitch, Yaw, Throttle]
        """

        # WIP, copied and pasted from quadx
        if (mode < -1 or mode > 7) and mode not in self.registered_controllers.keys():
            raise ValueError(
                f"`mode` must be between -1 and 7 or be registered in {self.registered_controllers.keys()=}, got {mode}."
            )

        self.mode = mode

        # for custom modes
        if mode in self.registered_controllers.keys():
            self.instanced_controllers[mode] = self.registered_controllers[mode]()
            mode = self.registered_base_modes[mode]

    def update_physics(self):
        """update_physics.
        """
        self.update_state()
        self.update_forces()
        pass

    def update_avionics(self):
        """
        updates state and control
        """
        self.update_control()

        if self.use_camera:
            self.capture_image()

Gymnasium Hover Environment Flight Dome

In the current implementation for the dome size it is calculated from the zero position.

I think a better implementation is to have the dome size relative to the starting (hover) point.

I already have the modification locally, if you think it's better let me know and I'll do the PR.

Flight Modes documentation

I plan to write a detailed documentation explaining each of the flight mode.

Do you have any preferred way to add this documentation ?

Thanks.

Camera moving away from drone

Hey,

I am seeing some strange behavior where after a few seconds, the onboard camera of the drone moves away, such that the drone sees itself. Here is a minimal example, only flying straight up.
Also, the drone seems to be of slow?

import numpy as np
import tqdm
import matplotlib.pyplot as plt

from PyFlyt.core import Aviary

# the starting position and orientation
start_pos_quadx = np.array([0.0, 0.0, 1.0])
start_poss = np.array([start_pos_quadx])
start_orn = np.array([np.zeros(3)])

# individual spawn options
quadx_options = dict(use_camera=True, drone_model="cf2x", camera_angle_degrees=-90, camera_FOV_degrees=130)

# environment setup
env = Aviary(
    start_pos=start_poss,
    start_orn=start_orn,
    render=True,
    drone_type=["quadx"],
    drone_options=[quadx_options],
)

# set quadx to angular velocity control
env.set_mode([0])

# prepare a window for the camera
fig, ax = plt.subplots(1, 1, figsize=(10, 5))
ax.set_title("QUADX")
plt.ioff()
fig.tight_layout()

# Initialize the image
img_quadx = ax.imshow(env.drones[0].rgbaImg, animated=True)
img_w, img_h = env.drones[0].rgbaImg.shape[0], env.drones[0].rgbaImg.shape[0] # 128, 128

# simulate for 1000 steps (1000/120 ~= 8 seconds)
for i in tqdm.tqdm(range(250)):
    env.step()
    env.set_setpoint(0, np.array([0.0, 0.0, 0.0, 1.]))
    img_quadx.set_data(env.drones[0].rgbaImg)
    fig.canvas.draw_idle()
    plt.pause(0.001)

del env

Proposal: Add SpaceX style rocket

Testing code:

import numpy as np

from PyFlyt.core import Aviary

# the starting position and orientations
start_pos = np.array([[0.0, 0.0, 5.0]])
start_orn = np.array([[1.571, 0.0, 0.0]])

# environment setup
env = Aviary(
    start_pos=start_pos,
    start_orn=start_orn,
    render=True,
    drone_type="rocket",
    drone_model="rocket",
    # use_camera=True,
)

# set to position control
env.set_mode(0)

env.drones[0].get_joint_info()

# simulate for 1000 steps (1000/120 ~= 8 seconds)
i = 0
while True:
    i += 1
    print("----------------------------------------------------")
    print(f"Fuel remaining: {env.drones[0].boosters.ratio_fuel_remaining}")
    print(f"Throttle setting: {env.drones[0].boosters.throttle_setting}")
    print(f"Ignition state: {env.drones[0].boosters.ignition_state}")
    print(f"Fuel mass: {env.getDynamicsInfo(env.drones[0].Id, 0)[0]}")
    print(f"Fuel inertia: {env.getDynamicsInfo(env.drones[0].Id, 0)[2]}")
    print(f"Rocket velocity: {env.drones[0].state[2]}")
    print(f"Rocket position: {env.drones[0].state[3]}")

    # pitch, yaw, ignition, thrust_setting, booster_gimbal_1, booster_gimbal_2
    env.drones[0].setpoint = np.array([0.0, 0.0, 1.0, 1.0, 0.0, 0.0])
    env.step()

Major Bug: PyFlyt currently not using the `URDF_USE_INERTIA_FROM_FILE` flag

PyFlyt is currently not using the URDF_USE_INERTIA_FROM_FILE flag, resulting in all the current vehicle simulations not obeying the inertia tensor defined in the URDF file. This breaks all environments, and would require a version bump for each environment.

Changing Rewards

I am using the QuadX-Waypoints env but want to scale the rewards, only the waypoint reached and crashed rewards. I downloaded the simulation through PyPi.

Can you tell me how to do this, do I call a specific variable? I can't quite figure it out.

Thanks :)

RecursionError: maximum recursion depth exceeded while calling a Python object

I am training a PyFlyt gym env using DreamerV3 from the SheepRL library. While doing so, I am running into a RecursionError: maximum recursion depth exceeded while calling a Python object error. This is likely because the Bullet env is newly instantiated for every env reset.
Any ideas why this is and how to solve it?

Issue over at SheepRL:
Eclectic-Sheep/sheeprl#306

Using video frame as an input

Thank you for the work on this awesome package! Super elegant and clean.

I have a question about the best usage of this package.

My goal would be to train a model to fly towards the targate using only video stream and drone sensor values as an input. Without relying on coordinates.
So I would have a video with an intermediate model that does object detection/tracking and the drone would need to fly towards that detected object.

Is it possible to take a video frame, apply object detection, and use it as an input for the model? What is the best to implement it using this repo?

And a side question about integration with betaflight, to train a model that uses the realistic commands from betaflight's SITL. What do you think the best approach is to do it?

[BUG] Dense reward not applied correctly in QuadXHover environment

I think the dense reward is not applied correctly in the QuadXHover environment:

https://github.com/jjshoots/PyFlyt/blob/0b4e89550680c153d8650ada6f6a58362c60f579/PyFlyt/gym_envs/quadx_envs/quadx_hover_env.py#L101C9-L110

According to the paper, the Euclidian and angle difference should be applied when the user sets sprase_reward to False. I created #6 to fix this. This problem doesn't seem to exist in other environments.

Determining Realistic Motor and Control Params

I have a quadcopter with a flight controller running ArduPilot. I'm looking to use PyFlyt to train an RL policy to complete different maneuvers with a simulated quadcopter. I then plan to transfer this policy to my actual quadcopter.

Do you have any recommendations on calculating or measuring the motor and control params that will most accurately model real life? I plan to use the primitive_drone example as a starting point. Would it be useful to incorporate the actual PID values from ArduPilot at all?

Quad waypoints

I am not sure why but I am not able to run the quadx waypoints environment with stable baselines 3. It seems to be a incompatible type (sb3 os requiring gymnasium environment, and, although quadx waypoint parent heirs Gymnasium Env, it is not accepted).
I just assumed that you used sb3. I did not find this implementation here. Could you help me ?

my main goal is to run quadx waypoints environment with drone set_mode(6)

thank you.

Proposal: Add NED Interial frame and FRD Body frame

I need to have the inertial frame follow NED and body frame follow FRD.

I already did a good progress in this, I was wondering if you have a preferred method to do so to add it to the framework.

Why I need this ? To facilitate the integration into an Ardupilot custom controller.

jjshoots / pyflyt Goto Github PK

pyflyt's Issues

Recommend Projects

Recommend Topics

Recommend Org