Abstract: Droplet microfluidics enables kHz screening of picoliter samples at a fraction of the cost of other high-throughput approaches. However, generating stable droplets with desired characteristics typically requires labor-intensive empirical optimization of device designs and flow conditions that limit adoption to specialist labs. Here, we compile the most comprehensive droplet dataset to date and use it to train machine learning models capable of accurately predicting device geometries and flow conditions required to generate stable aqueous-in-oil and oil-in-aqueous single and double emulsions from 15 to 250 $\mu$m at rates up to 12000 Hz for different fluids commonly used in life sciences. Novel device geometries predicted by our models for as-yet-unseen fluids yield accurate predictions, establishing their generalizability. Finally, we generate an easy-to-use design automation tool
that yield droplets within 3 $\mu$m (< 8\%) of the desired diameter, facilitating tailored droplet-based platforms for new applications and accelerating their utility in life sciences.
The GitHub repository for DAFD 3.0 can be found at: https://github.com/CIDARLAB/dafd-website