custom loss function enhancer design #15

SeppeDeWinter · 2024-09-05T11:19:30Z

Fix typing syntax error
Allow for custom optimization function for in silicio evolution
Change typing of target param in EnhancerOptimizer
Allow target to be passed using params.
Allow for custom optimization function for motif embedding.

For example:

# use Heart, muscle and myoblast cells as contrast
classes_of_interest = [
    i for i, ct in enumerate(adata.obs_names)
    if "Heart" in ct or "muscle" in ct or "myoblast" in ct
]

# design enhancers that are high in heart but low in myoblast
target = np.array(
      [
        0 if "Cardiac muscle" not in x else 1 for x in adata.obs_names
        if "Heart" in x or "muscle" in x or "myoblast" in x
    ]
)

assert all(["Cardiac muscle" in x for x in adata.obs_names[np.array(classes_of_interest)[np.where(target)[0]]]])

from sklearn.metrics import pairwise
from crested.tl._utils import EnhancerOptimizer

def L2_distance(
    mutated_predictions: np.ndarray,
    original_prediction: np.ndarray,
    target: np.ndarray,
    classes_of_interest: list[int]):
    def scale(X):
        return ((X.T - X.min(1)) / (X.max(1) - X.min(1))).T
    L2_sat_mut = pairwise.euclidean_distances(scale(mutated_predictions)[:,classes_of_interest], target.reshape(1, -1))
    L2_baseline = pairwise.euclidean_distances(scale(original_prediction)[:, classes_of_interest], target.reshape(1, -1))
    return np.argmax((L2_baseline - L2_sat_mut).squeeze())

L2_optimizer = EnhancerOptimizer(
    optimize_func = L2_distance
)

intermediate_info_list, designed_sequences = evaluator.enhancer_design_in_silico_evolution(
  target_class=None, n_sequences=1, n_mutations=30,
  enhancer_optimizer = L2_optimizer,
  target = target,
  return_intermediate = True,
  no_mutation_flanks = (807, 807),
  classes_of_interest = classes_of_interest
)

SeppeDeWinter · 2024-09-05T11:20:23Z

Note, code for motif embedding has not been tested yet.

LukasMahieu · 2024-09-10T14:30:54Z

Okay, looks good and makes sense to me. In the near future we should really make a separate tutorial for enhancer design (including this information here), since as of now it's a one-liner in the introductory tutorial.
@erceksi could you take a look too since you implemented the original function?

SeppeDeWinter · 2024-09-20T11:54:59Z

Added some extra changes.

Now multiple sequences should be processed in parallel.
Before a call was made to model.predict for each sequence and each iteration.

Now a single call is made to model.predict for each iteration only.

SeppeDeWinter · 2024-09-20T13:00:28Z

From a quick and dirty benchmark, this code should be around 2x faster.

SeppeDeWinter added 5 commits September 5, 2024 11:18

Fix typing syntax error

caf9971

Allow for custom optimization function for in silicio evolution

da569ba

Change typing of target param in EnhancerOptimizer

f6364ae

Allow target to be passed using params.

02dd4e1

Allow for custom optimization function for motif embedding.

ee2d9c9

LukasMahieu requested a review from erceksi September 10, 2024 14:31

SeppeDeWinter added 2 commits September 20, 2024 13:50

Performance improvement by parallelizing across sequences.

328682f

Add progress bar.

668c1d8

SeppeDeWinter added 3 commits September 20, 2024 14:01

revert change from n_mutations to n_iter.

f40bbad

Use 3 dimensional array for

530d0db

fix several small bugs.

e42fa41

SeppeDeWinter added 2 commits September 23, 2024 17:13

Store intermediate info.

db3af2a

Merge branch 'main' into custom_loss_function_in_silico_evolution

095d753

SeppeDeWinter merged commit 81ba818 into main Sep 27, 2024
4 checks passed

nkempynck deleted the custom_loss_function_in_silico_evolution branch September 30, 2024 15:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom loss function enhancer design #15

custom loss function enhancer design #15

SeppeDeWinter commented Sep 5, 2024 •

edited

Loading

SeppeDeWinter commented Sep 5, 2024

LukasMahieu commented Sep 10, 2024

SeppeDeWinter commented Sep 20, 2024

SeppeDeWinter commented Sep 20, 2024

custom loss function enhancer design #15

custom loss function enhancer design #15

Conversation

SeppeDeWinter commented Sep 5, 2024 • edited Loading

SeppeDeWinter commented Sep 5, 2024

LukasMahieu commented Sep 10, 2024

SeppeDeWinter commented Sep 20, 2024

SeppeDeWinter commented Sep 20, 2024

SeppeDeWinter commented Sep 5, 2024 •

edited

Loading