Inverse design optimization of a mode converter

Inverse design optimization of a mode converter#

In this notebook, we will use inverse design and Tidy3D to create an integrated photonics component to convert a fundamental waveguide mode to a higher order mode.

[1]:

from typing import List

import autograd.numpy as anp
import matplotlib.pylab as plt
import numpy as np

# import regular tidy3d
import tidy3d as td
import tidy3d.web as web
from autograd import value_and_grad
from tidy3d.plugins.mode import ModeSolver

# set random seed to get same results
np.random.seed(2)

Setup#

We wish to recreate a device like the diagram below:

Schematic of the mode converter

A mode source is injected into a waveguide on the left-hand side. The light propagates through a rectangular region with pixelated permittivity with the value of each pixel independently tunable between 1 (vacuum) and some maximum permittivity. Finally, we measure the transmission of the light into a waveguide on the right-hand side.

The goal of the inverse design exercise is to find the best distribution of permittivities (\(\epsilon_{ij}\)) in the coupling region to maximize the power conversion between the input mode and the output mode.

We also apply our built-in smoothening and binarization filters to ensure that the final device has smooth features, and permittivity values that are all either 1, or the maximum permittivity of the waveguide material.

Parameters#

First we will define some parameters.

[2]:

# wavelength and frequency
wavelength = 1.0
freq0 = td.C_0 / wavelength
k0 = 2 * np.pi * freq0 / td.C_0

# resolution control
min_steps_per_wvl = 16
# in the design region, we set uniform grid resolution,
# and define the design parameters on the same grid
dl_design_region = 0.01

# space between boxes and PML
buffer = 1.0 * wavelength

# optimize region size
lz = td.inf
lx = 5.0
ly = 3.0

# position of source and monitor (constant for all)
source_x = -lx / 2 - buffer * 0.8
meas_x = lx / 2 + buffer * 0.8

# total size
Lx = lx + 2 * buffer
Ly = ly + 2 * buffer
Lz = 0.0

# permittivity and width of the input/output waveguide
eps_wg = 2.75
wg_width = 0.7

# random starting parameters between 0 and 1
nx = int(lx / dl_design_region)
ny = int(ly / dl_design_region)
params0 = np.random.random((nx, ny))

# frequency width and run time
freqw = freq0 / 10
run_time = 50 / freqw

Static Components#

Next, we will set up the static parts of the geometry, the input source, and the output monitor using these parameters.

[3]:

waveguide = td.Structure(
    geometry=td.Box(size=(2 * Lx, wg_width, lz)), medium=td.Medium(permittivity=eps_wg)
)

mode_size = (0, wg_width * 3, lz)

source_plane = td.Box(
    center=[source_x, 0, 0],
    size=mode_size,
)

measure_plane = td.Box(
    center=[meas_x, 0, 0],
    size=mode_size,
)

Input Structures#

Next, we write a function to return the pixelated array given our flattened tuple of permittivity values \(\epsilon_{ij}\) using the tidy3d.plugins.autograd plugin.

We start with an array of parameters between 0 and 1, apply a conic filter and tanh projection to compute smooth, well-binarized features.

[4]:

from tidy3d.plugins.autograd import make_filter_and_project, rescale

# radius of the circular filter (um) and the threshold strength
radius = 0.120
beta = 50

filter_project = make_filter_and_project(radius, lx / nx)


def get_eps(params, beta):
    """Get the permittivity values (1, eps_wg) array as a function of the parameters (0, 1)"""
    processed_params = filter_project(params, beta)
    eps = rescale(processed_params, 1, eps_wg)
    return eps


def make_input_structures(params, beta) -> List[td.Structure]:
    box = td.Box(center=(0, 0, 0), size=(lx, ly, lz))
    eps_data = get_eps(params, beta=beta).reshape((nx, ny, 1))
    custom_structure = td.Structure.from_permittivity_array(geometry=box, eps_data=eps_data)

    return [custom_structure]

Making the Simulation#

Next, we write a function to return a basic td.Simulation as a function of our parameter values.

We make sure to add the pixelated td.Structure list to input_structures but leave out the sources and monitors for now as we’ll want to add those after the mode solver is run so we can inspect them.

[5]:

def make_sim_base(params: np.ndarray, beta: float) -> td.Simulation:
    """Create the simulation given the parameters and some beta value."""

    structures = make_input_structures(params, beta=beta)
    design_region_mesh = td.MeshOverrideStructure(
        geometry=td.Box(size=(lx, ly, lz)),
        dl=[dl_design_region] * 3,
        enforce=True,
    )
    grid_spec = td.GridSpec.auto(
        wavelength=wavelength,
        min_steps_per_wvl=16,
        override_structures=[design_region_mesh],
    )

    return td.Simulation(
        size=[Lx, Ly, Lz],
        grid_spec=grid_spec,
        structures=[waveguide] + structures,
        sources=[],
        monitors=[],
        run_time=run_time,
        boundary_spec=td.BoundarySpec.pml(x=True, y=True, z=False),
    )

Visualize#

Let’s visualize the simulation to see how it looks

[6]:

sim_start = make_sim_base(params0, beta=1.0)

ax = sim_start.plot_eps(z=0, freq=freq0)

plt.show()

../_images/notebooks_Autograd3InverseDesign_11_0.png

Select Input and Output Modes#

Next, let’s visualize the first 4 mode profiles so we can select which mode indices we want to inject and transmit.

[7]:

from tidy3d.plugins.mode.web import run as run_mode_solver

num_modes = 4
mode_spec = td.ModeSpec(num_modes=num_modes)

mode_solver = ModeSolver(
    simulation=sim_start,
    plane=source_plane,
    mode_spec=td.ModeSpec(num_modes=num_modes),
    freqs=[freq0],
)
modes = run_mode_solver(mode_solver, reduce_simulation=True)

11:43:09 CEST Mode solver created with
              task_id='fdve-61ac1ec3-c51b-4d99-8207-cb968f2b9699',
              solver_id='mo-49771a0a-5121-46e6-b551-d224a3631285'.

11:43:12 CEST Mode solver status: queued

11:43:15 CEST Mode solver status: running

11:43:19 CEST Mode solver status: success

Let’s visualize the modes next.

[8]:

fig, axs = plt.subplots(num_modes, 3, figsize=(12, 12), tight_layout=True)
for mode_index in range(num_modes):
    vmax = 1.1 * max(
        abs(modes.field_components[n].sel(mode_index=mode_index)).max() for n in ("Ex", "Ey", "Ez")
    )
    for field_name, ax in zip(("Ex", "Ey", "Ez"), axs[mode_index]):
        field = modes.field_components[field_name].sel(mode_index=mode_index)
        field.real.plot(label="Real", ax=ax)
        field.imag.plot(ls="--", label="Imag", ax=ax)
        ax.set_title(f"index={mode_index}, {field_name}")
        ax.set_ylim(-vmax, vmax)

axs[0, 0].legend()

print("Effective index of computed modes: ", np.array(modes.n_eff))

Effective index of computed modes:  [[1.5720801  1.5354629  1.30322984 1.18481492]]

../_images/notebooks_Autograd3InverseDesign_15_1.png

We want to inject the fundamental, Ez-polarized input into the 1st order Ez-polarized input.

From the plots, we see that these modes correspond to the first and third rows, or mode_index=0 and mode_index=2, respectively.

So we make sure that the mode_index_in and mode_index_out variables are set appropriately and we set a ModeSpec with 3 modes to be able to capture the mode_index_out in our output data.

[9]:

mode_index_in = 0
mode_index_out = 2

num_modes = max(mode_index_in, mode_index_out) + 1

mode_spec = td.ModeSpec(num_modes=num_modes)

Then it is straightforward to generate our source and monitor.

[10]:

# source seeding the simulation
forward_source = td.ModeSource(
    source_time=td.GaussianPulse(freq0=freq0, fwidth=freqw),
    center=[source_x, 0, 0],
    size=mode_size,
    mode_index=mode_index_in,
    mode_spec=mode_spec,
    direction="+",
)

# we'll refer to the measurement monitor by this name often
measurement_monitor_name = "measurement"

# monitor where we compute the objective function from
measurement_monitor = td.ModeMonitor(
    center=[meas_x, 0, 0],
    size=mode_size,
    freqs=[freq0],
    mode_spec=mode_spec,
    name=measurement_monitor_name,
)

Finally, we create a new function that calls our make_sim_base() function and adds the source and monitor to the result. This is the function we will use in our objective function to generate our td.Simulation given the input parameters.

[11]:

def make_sim(params, beta):
    sim = make_sim_base(params, beta=beta)
    return sim.updated_copy(sources=[forward_source], monitors=[measurement_monitor])

[12]:

sim_start = make_sim_base(params0, beta=1.0)

ax = sim_start.plot_eps(z=0, freq=freq0)

plt.show()

../_images/notebooks_Autograd3InverseDesign_22_0.png

Post Processing#

Next, we will define a function to tell us how we want to postprocess the output td.SimulationData object to give the conversion power that we are interested in maximizing.

[13]:

def measure_power(sim_data: td.SimulationData) -> float:
    """Return the power in the output_data amplitude at the mode index of interest."""
    output_amps = sim_data["measurement"].amps
    amp = output_amps.sel(direction="+", f=freq0, mode_index=mode_index_out).values
    return anp.sum(anp.abs(amp) ** 2)

Then, we add a penalty to produce structures that are invariant under erosion and dilation, which is a useful approach to implementing minimum length scale features.

[14]:

from tidy3d.plugins.autograd import make_erosion_dilation_penalty

penalty = make_erosion_dilation_penalty(radius, lx / nx)

Define Objective Function#

Finally, we need to define the objective function that we want to maximize as a function of our input parameters (permittivity of each pixel) that returns the conversion power. This is the function we will differentiate later.

[ ]:

def J(params: np.ndarray, beta: float, step_num: int = None, verbose: bool = False) -> float:
    sim = make_sim(params, beta=beta)
    task_name = "inv_des"
    if step_num:
        task_name += f"_step_{step_num}"
    sim_data = web.run(sim, task_name=task_name, verbose=verbose)
    penalty_weight = np.minimum(1, beta / 25)
    density = filter_project(params, beta)
    return measure_power(sim_data) - penalty_weight * penalty(density)

Inverse Design#

Now we are ready to perform the optimization.

We use the autograd.value_and_grad function to get the gradient of J with respect to the permittivity of each Box, while also returning the converted power associated with the current iteration, so we can record this value for later.

Let’s try running this function once to make sure it works.

[16]:

dJ_fn = value_and_grad(J)

[17]:

val, grad = dJ_fn(params0, beta=1, verbose=True)
print(grad.shape)

11:43:22 CEST Created task 'inv_des' with task_id
              'fdve-a3049051-298d-400b-8005-9d357af529be' and task_type 'FDTD'.

              View task using web UI at
              'https://tidy3d.simulation.cloud/workbench?taskId=fdve-a3049051-29
              8d-400b-8005-9d357af529be'.

              Task folder: 'default'.

11:43:25 CEST Maximum FlexCredit cost: 0.025. Minimum cost depends on task
              execution details. Use 'web.real_cost(task_id)' to get the billed
              FlexCredit cost after a simulation run.

11:43:26 CEST status = success

11:43:29 CEST loading simulation from simulation_data.hdf5

11:43:33 CEST Started working on Batch containing 1 tasks.

11:43:35 CEST Maximum FlexCredit cost: 0.025 for the whole batch.

              Use 'Batch.real_cost()' to get the billed FlexCredit cost after
              the Batch has completed.

11:44:15 CEST Batch complete.

(500, 300)

[18]:

print(grad)

[[ 1.06394305e-08  2.10503245e-08  2.04421996e-08 ... -4.52559872e-09
  -4.56854173e-09 -2.29766184e-09]
 [ 2.14813263e-08  4.25093263e-08  4.12695593e-08 ... -9.07254446e-09
  -9.18642283e-09 -4.62172260e-09]
 [ 2.21538748e-08  4.38291949e-08  4.25112125e-08 ... -9.21263785e-09
  -9.41134572e-09 -4.74763104e-09]
 ...
 [ 3.40851109e-07  6.84428345e-07  6.92983184e-07 ... -6.48015089e-07
  -6.37733413e-07 -3.17189464e-07]
 [ 3.39603267e-07  6.81963317e-07  6.90605446e-07 ... -6.42102252e-07
  -6.31742331e-07 -3.14174591e-07]
 [ 1.69651307e-07  3.40655067e-07  3.44995327e-07 ... -3.20109461e-07
  -3.14909013e-07 -1.56610965e-07]]

Optimization#

We will use “Adam” optimization strategy to perform sequential updates of each of the permittivity values in the td.CustomMedium.

For more information on what we use to implement this method, see this article.

We will run 10 steps and measure both the permittivities and powers at each iteration.

We capture this process in an optimize function, which accepts various parameters that we can tweak.

[19]:

import optax

# hyperparameters
num_steps = 20
learning_rate = 1.0

# initialize adam optimizer with starting parameters
params = np.array(params0)
optimizer = optax.adam(learning_rate=learning_rate)
opt_state = optimizer.init(params)

# store history
Js = []
params_history = [params]
beta_history = []

# gradually increase the binarization strength
beta0 = 1
beta_increment = 1

for i in range(num_steps):
    # compute gradient and current objective function value

    density = filter_project(params, beta)
    plt.subplots(figsize=(2, 2))
    plt.imshow(np.flipud(1 - density.T), cmap="gray")
    plt.axis("off")
    plt.show()

    beta = beta0 + i * beta_increment
    value, gradient = dJ_fn(params, step_num=i + 1, beta=beta)

    # outputs
    print(f"step = {i + 1}")
    print(f"\tbeta = {beta:.4e}")
    print(f"\tJ = {value:.4e}")
    print(f"\tgrad_norm = {np.linalg.norm(gradient):.4e}")

    # compute and apply updates to the optimizer based on gradient (-1 sign to maximize obj_fn)
    updates, opt_state = optimizer.update(-gradient, opt_state, params)
    params[:] = optax.apply_updates(params, updates)

    # cap the parameters
    anp.clip(params, 0.0, 1.0, out=params)

    # save history
    Js.append(value)
    params_history.append(params.copy())
    beta_history.append(beta)

power = J(params_history[-1], beta=beta)
Js.append(power)

../_images/notebooks_Autograd3InverseDesign_34_0.png

step = 1
        beta = 1.0000e+00
        J = -3.9948e-02
        grad_norm = 4.5323e-04

../_images/notebooks_Autograd3InverseDesign_34_2.png

step = 2
        beta = 2.0000e+00
        J = 3.6106e-02
        grad_norm = 1.0608e-02

../_images/notebooks_Autograd3InverseDesign_34_4.png

step = 3
        beta = 3.0000e+00
        J = -4.7245e-02
        grad_norm = 7.5188e-03

../_images/notebooks_Autograd3InverseDesign_34_6.png

step = 4
        beta = 4.0000e+00
        J = -2.4368e-02
        grad_norm = 8.1877e-03

../_images/notebooks_Autograd3InverseDesign_34_8.png

step = 5
        beta = 5.0000e+00
        J = 2.0712e-01
        grad_norm = 1.5346e-02

../_images/notebooks_Autograd3InverseDesign_34_10.png

step = 6
        beta = 6.0000e+00
        J = 4.6264e-01
        grad_norm = 2.0367e-02

../_images/notebooks_Autograd3InverseDesign_34_12.png

step = 7
        beta = 7.0000e+00
        J = 5.6415e-01
        grad_norm = 1.8807e-02

../_images/notebooks_Autograd3InverseDesign_34_14.png

step = 8
        beta = 8.0000e+00
        J = 7.1625e-01
        grad_norm = 1.8309e-02

../_images/notebooks_Autograd3InverseDesign_34_16.png

step = 9
        beta = 9.0000e+00
        J = 7.2163e-01
        grad_norm = 1.8677e-02

../_images/notebooks_Autograd3InverseDesign_34_18.png

step = 10
        beta = 1.0000e+01
        J = 7.9079e-01
        grad_norm = 1.1223e-02

../_images/notebooks_Autograd3InverseDesign_34_20.png

step = 11
        beta = 1.1000e+01
        J = 8.4924e-01
        grad_norm = 7.6408e-03

../_images/notebooks_Autograd3InverseDesign_34_22.png

step = 12
        beta = 1.2000e+01
        J = 8.6322e-01
        grad_norm = 7.2300e-03

../_images/notebooks_Autograd3InverseDesign_34_24.png

step = 13
        beta = 1.3000e+01
        J = 8.7872e-01
        grad_norm = 5.2626e-03

../_images/notebooks_Autograd3InverseDesign_34_26.png

step = 14
        beta = 1.4000e+01
        J = 8.8948e-01
        grad_norm = 4.3821e-03

../_images/notebooks_Autograd3InverseDesign_34_28.png

step = 15
        beta = 1.5000e+01
        J = 9.0168e-01
        grad_norm = 3.5043e-03

../_images/notebooks_Autograd3InverseDesign_34_30.png

step = 16
        beta = 1.6000e+01
        J = 9.0462e-01
        grad_norm = 5.9954e-03

../_images/notebooks_Autograd3InverseDesign_34_32.png

step = 17
        beta = 1.7000e+01
        J = 9.0951e-01
        grad_norm = 5.4800e-03

../_images/notebooks_Autograd3InverseDesign_34_34.png

step = 18
        beta = 1.8000e+01
        J = 9.1838e-01
        grad_norm = 3.6162e-03

../_images/notebooks_Autograd3InverseDesign_34_36.png

step = 19
        beta = 1.9000e+01
        J = 9.2531e-01
        grad_norm = 2.8894e-03

../_images/notebooks_Autograd3InverseDesign_34_38.png

step = 20
        beta = 2.0000e+01
        J = 9.2823e-01
        grad_norm = 3.3079e-03

[20]:

params_final = params_history[-1]

Let’s run the optimize function.

and then record the final power value (including the last iteration’s parameter updates).

Results#

First, we plot the objective function (power converted to 1st order mode) as a function of step and notice that it converges nicely!

[21]:

plt.plot(Js)
plt.xlabel("iterations")
plt.ylabel("objective function")
plt.show()

../_images/notebooks_Autograd3InverseDesign_39_0.png

We then will visualize the final structure, so we convert it to a regular Simulation using the final permittivity values and plot it.

[22]:

sim_final = make_sim(params_final, beta=beta)

[23]:

sim_final.plot_eps(z=0, freq=freq0)
plt.show()

../_images/notebooks_Autograd3InverseDesign_42_0.png

Finally, we want to inspect the fields, so we add a field monitor to the Simulation and perform one more run to record the field values for plotting.

[24]:

field_mnt = td.FieldMonitor(
    size=(td.inf, td.inf, 0),
    freqs=[freq0],
    name="field_mnt",
)

sim_final = sim_final.copy(update=dict(monitors=(field_mnt, measurement_monitor)))

[25]:

sim_data_final = web.run(sim_final, task_name="inv_des_final")

12:13:04 CEST Created task 'inv_des_final' with task_id
              'fdve-6ebb1d0d-1f17-447d-9219-1aefb713dfef' and task_type 'FDTD'.

              View task using web UI at
              'https://tidy3d.simulation.cloud/workbench?taskId=fdve-6ebb1d0d-1f
              17-447d-9219-1aefb713dfef'.

              Task folder: 'default'.

12:13:08 CEST Maximum FlexCredit cost: 0.025. Minimum cost depends on task
              execution details. Use 'web.real_cost(task_id)' to get the billed
              FlexCredit cost after a simulation run.

12:13:09 CEST status = queued

              To cancel the simulation, use 'web.abort(task_id)' or
              'web.delete(task_id)' or abort/delete the task in the web UI.
              Terminating the Python script will not stop the job running on the
              cloud.

12:13:16 CEST starting up solver

              running solver

12:13:18 CEST early shutoff detected at 12%, exiting.

12:13:19 CEST status = postprocess

12:13:21 CEST status = success

12:13:23 CEST View simulation result at
              'https://tidy3d.simulation.cloud/workbench?taskId=fdve-6ebb1d0d-1f
              17-447d-9219-1aefb713dfef'.

12:13:26 CEST loading simulation from simulation_data.hdf5

We notice that the behavior is as expected and the device performs exactly how we intended!

[26]:

f, (ax0, ax1, ax2) = plt.subplots(1, 3, figsize=(12, 3), tight_layout=True)
sim_final.plot_eps(z=0.01, freq=freq0, ax=ax0)
ax1 = sim_data_final.plot_field("field_mnt", "Ez", z=0, ax=ax1)
ax2 = sim_data_final.plot_field("field_mnt", "E", "abs^2", z=0, ax=ax2)

../_images/notebooks_Autograd3InverseDesign_47_0.png

The final device converts more than 90% of the input power to the 1st mode, up from < 1% when we started.

[27]:

final_power = (
    sim_data_final["measurement"].amps.sel(direction="+", f=freq0, mode_index=mode_index_out).abs
    ** 2
)
print(f"Final power conversion = {final_power * 100:.2f}%")

Final power conversion = 96.23%

Export to GDS#

The Simulation object has the .to_gds_file convenience function to export the final design to a GDS file. In addition to a file name, it is necessary to set a cross-sectional plane (z = 0 in this case) on which to evaluate the geometry, a frequency to evaluate the permittivity, and a permittivity_threshold to define the shape boundaries in custom mediums. See the GDS export notebook for a detailed example on using .to_gds_file and other GDS related functions.

[28]:

sim_final.to_gds_file(
    fname="./misc/inv_des_mode_conv.gds",
    z=0,
    frequency=freq0,
    permittivity_threshold=2.6,
)