Skip to main content

Showing 1–38 of 38 results for author: Osher, S J

  1. arXiv:2406.13781  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    A Primal-Dual Framework for Transformers and Neural Networks

    Authors: Tan M. Nguyen, Tam Nguyen, Nhat Ho, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Self-attention is key to the remarkable success of transformers in sequence modeling tasks including many applications in natural language processing and computer vision. Like neural network layers, these attention mechanisms are often developed by heuristics and experience. To provide a principled framework for constructing attention layers in transformers, we show that the self-attention corresp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2023, 26 pages, 4 figures, 14 tables

  2. arXiv:2402.17745  [pdf, other

    physics.comp-ph cs.CV physics.optics

    Low-light phase retrieval with implicit generative priors

    Authors: Raunak Manekar, Elisa Negrini, Minh Pham, Daniel Jacobs, Jaideep Srivastava, Stanley J. Osher, Jianwei Miao

    Abstract: Phase retrieval (PR) is fundamentally important in scientific imaging and is crucial for nanoscale techniques like coherent diffractive imaging (CDI). Low radiation dose imaging is essential for applications involving radiation-sensitive samples. However, most PR methods struggle in low-dose scenarios due to high shot noise. Recent advancements in optical data acquisition setups, such as in-situ C… ▽ More

    Submitted 23 August, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    MSC Class: 68T10 68T07 78A46

  3. arXiv:2402.06162  [pdf, other

    stat.ML cs.LG

    Wasserstein proximal operators describe score-based generative models and resolve memorization

    Authors: Benjamin J. Zhang, Siting Liu, Wuchen Li, Markos A. Katsoulakis, Stanley J. Osher

    Abstract: We focus on the fundamental mathematical structure of score-based generative models (SGMs). We first formulate SGMs in terms of the Wasserstein proximal operator (WPO) and demonstrate that, via mean-field games (MFGs), the WPO formulation reveals mathematical structure that describes the inductive bias of diffusion and score-based models. In particular, MFGs yield optimality conditions in the form… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2401.07364  [pdf, other

    cs.LG cs.AI math.NA

    PDE Generalization of In-Context Operator Networks: A Study on 1D Scalar Nonlinear Conservation Laws

    Authors: Liu Yang, Stanley J. Osher

    Abstract: Can we build a single large model for a wide range of PDE-related scientific learning tasks? Can this model generalize to new PDEs, even of new forms, without any fine-tuning? In-context operator learning and the corresponding model In-Context Operator Networks (ICON) represent an initial exploration of these questions. The capability of ICON regarding the first question has been demonstrated prev… ▽ More

    Submitted 21 January, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  5. arXiv:2310.01605  [pdf, other

    math.NA math.OC

    Primal-dual hybrid gradient algorithms for computing time-implicit Hamilton-Jacobi equations

    Authors: Tingwei Meng, Wenbo Hao, Siting Liu, Stanley J. Osher, Wuchen Li

    Abstract: Hamilton-Jacobi (HJ) partial differential equations (PDEs) have diverse applications spanning physics, optimal control, game theory, and imaging sciences. This research introduces a first-order optimization-based technique for HJ PDEs, which formulates the time-implicit update of HJ PDEs as saddle point problems. We remark that the saddle point formulation for HJ equations is aligned with the prim… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  6. arXiv:2308.05061  [pdf, other

    cs.LG math.NA stat.ML

    Fine-Tune Language Models as Multi-Modal Differential Equation Solvers

    Authors: Liu Yang, Siting Liu, Stanley J. Osher

    Abstract: In the growing domain of scientific machine learning, in-context operator learning has shown notable potential in building foundation models, as in this framework the model is trained to learn operators and solve differential equations using prompted data, during the inference stage without weight updates. However, the current model's overdependence on function data overlooks the invaluable human… ▽ More

    Submitted 1 February, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  7. arXiv:2306.11283  [pdf

    physics.optics cond-mat.soft physics.app-ph physics.bio-ph

    Computational Microscopy beyond Perfect Lenses

    Authors: Xingyuan Lu, Minh Pham, Elisa Negrini, Damek Davis, Stanley J. Osher, Jianwei Miao

    Abstract: We demonstrate that in situ coherent diffractive imaging (CDI), which harnesses the coherent interference between a strong and a weak beam illuminating a static and dynamic structure, can be a very dose-efficient imaging method. At low doses, in situ CDI can achieve higher resolution than perfect lenses with the point spread function as a delta function. Both our numerical simulation and experimen… ▽ More

    Submitted 3 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  8. arXiv:2304.07993  [pdf, other

    cs.LG math.NA stat.ML

    In-Context Operator Learning with Data Prompts for Differential Equation Problems

    Authors: Liu Yang, Siting Liu, Tingwei Meng, Stanley J. Osher

    Abstract: This paper introduces a new neural-network-based approach, namely In-Context Operator Networks (ICON), to simultaneously learn operators from the prompted data and apply it to new questions during the inference stage, without any weight update. Existing methods are limited to using a neural network to approximate a specific equation solution or a specific operator, requiring retraining when switch… ▽ More

    Submitted 19 September, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: The second and third authors contributed equally. This is an outdated preprint. Please refer to the updated version published in PNAS: www.pnas.org/doi/10.1073/pnas.2310142120 See code in https://github.com/LiuYangMage/in-context-operator-networks

  9. arXiv:2208.00579  [pdf, other

    cs.LG math.NA

    Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization

    Authors: Tan Nguyen, Richard G. Baraniuk, Robert M. Kirby, Stanley J. Osher, Bao Wang

    Abstract: Transformers have achieved remarkable success in sequence modeling and beyond but suffer from quadratic computational and memory complexities with respect to the length of the input sequence. Leveraging techniques include sparse and linear attention and hashing tricks; efficient transformers have been proposed to reduce the quadratic complexity of transformers but significantly degrade the accurac… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 22 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2110.07034

    MSC Class: 65Pxx

  10. arXiv:2207.00129  [pdf, other

    cs.MA cs.CG cs.RO eess.SY math.OC

    Multi-Agent Shape Control with Optimal Transport

    Authors: Alex Tong Lin, Stanley J. Osher

    Abstract: We introduce a method called MASCOT (Multi-Agent Shape Control with Optimal Transport) to compute optimal control solutions of agents with shape/formation/density constraints. For example, we might want to apply shape constraints on the agents -- perhaps we desire the agents to hold a particular shape along the path, or we want agents to spread out in order to minimize collisions. We might also wa… ▽ More

    Submitted 3 February, 2023; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: Fixed expressions for g_shape and L_shape in section 4.1, 4.2, 5.2, and 5.3

  11. arXiv:2206.00206  [pdf, ps, other

    cs.LG stat.ML

    Transformer with Fourier Integral Attentions

    Authors: Tan Nguyen, Minh Pham, Tam Nguyen, Khai Nguyen, Stanley J. Osher, Nhat Ho

    Abstract: Multi-head attention empowers the recent success of transformers, the state-of-the-art models that have achieved remarkable success in sequence modeling and beyond. These attention mechanisms compute the pairwise dot products between the queries and keys, which results from the use of unnormalized Gaussian kernels with the assumption that the queries follow a mixture of Gaussian distribution. Ther… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: 35 pages, 5 tables. Tan Nguyen and Minh Pham contributed equally to this work

  12. arXiv:2204.08621  [pdf, other

    math.NA cs.LG

    Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

    Authors: Justin Baker, Hedi Xia, Yiwei Wang, Elena Cherkaev, Akil Narayan, Long Chen, Jack Xin, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers. These solvers are computationally expensive, requiring the use of tiny step sizes for numerical stability and accuracy guarantees. This paper considers learning neural ODEs using implicit ODE solvers of different orders leveraging proximal operators. The proximal implicit so… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 20 pages, 7 figures

    MSC Class: 68T07; 65L04 ACM Class: I.2

  13. arXiv:2203.06269  [pdf, other

    cs.LG cs.CE eess.SY math.NA

    Parameter Inference of Time Series by Delay Embeddings and Learning Differentiable Operators

    Authors: Alex Tong Lin, Adrian S. Wong, Robert Martin, Stanley J. Osher, Daniel Eckhardt

    Abstract: We provide a method to identify system parameters of dynamical systems, called ID-ODE -- Inference by Differentiation and Observing Delay Embeddings. In this setting, we are given a dataset of trajectories from a dynamical system with system parameter labels. Our goal is to identify system parameters of new trajectories. The given trajectories may or may not encompass the full state of the system,… ▽ More

    Submitted 16 November, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

  14. arXiv:2110.08678  [pdf, other

    cs.LG cs.CL stat.ML

    Improving Transformers with Probabilistic Attention Keys

    Authors: Tam Nguyen, Tan M. Nguyen, Dung D. Le, Duy Khuong Nguyen, Viet-Anh Tran, Richard G. Baraniuk, Nhat Ho, Stanley J. Osher

    Abstract: Multi-head attention is a driving force behind state-of-the-art transformers, which achieve remarkable performance across a variety of natural language processing (NLP) and computer vision tasks. It has been observed that for many applications, those attention heads learn redundant embedding, and most of them can be removed without degrading the performance of the model. Inspired by this observati… ▽ More

    Submitted 12 June, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: 27 pages, 16 figures, 10 tables

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  15. arXiv:2110.04840  [pdf, other

    cs.LG cs.AI math.DS math.NA

    Heavy Ball Neural Ordinary Differential Equations

    Authors: Hedi Xia, Vai Suliafu, Hangjie Ji, Tan M. Nguyen, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: We propose heavy ball neural ordinary differential equations (HBNODEs), leveraging the continuous limit of the classical momentum accelerated gradient descent, to improve neural ODEs (NODEs) training and inference. HBNODEs have two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers,… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

    Comments: 23 pages, 9 figures, Accepted for publication at Advances in Neural Information Processing Systems (NeurIPS) 2021

    MSC Class: 68T07 ACM Class: I.2

  16. arXiv:2108.02347  [pdf, other

    cs.LG cs.AI math.NA

    FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention

    Authors: Tan M. Nguyen, Vai Suliafu, Stanley J. Osher, Long Chen, Bao Wang

    Abstract: We propose FMMformers, a class of efficient and flexible transformers inspired by the celebrated fast multipole method (FMM) for accelerating interacting particle simulation. FMM decomposes particle-particle interaction into near-field and far-field components and then performs direct and coarse-grained computation, respectively. Similarly, FMMformers decompose the attention into near-field and fa… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: 18 pages, 8 figures

    MSC Class: 68T07 ACM Class: I.2

  17. arXiv:2104.12933  [pdf

    cond-mat.str-el cond-mat.mes-hall cond-mat.mtrl-sci

    Direct observation of 3D topological spin textures and their interactions using soft x-ray vector ptychography

    Authors: Arjun Rana, Chen-Ting Liao, Ezio Iacocca, Ji Zou, Minh Pham, Emma-Elizabeth Cating Subramanian, Yuan Hung Lo, Sinéad A. Ryan, Xingyuan Lu, Charles S. Bevis, Robert M. Karl Jr, Andrew J. Glaid, Young-Sang Yu, Pratibha Mahale, David A. Shapiro, Sadegh Yazdi, Thomas E. Mallouk, Stanley J. Osher, Henry C. Kapteyn, Vincent H. Crespi, John V. Badding, Yaroslav Tserkovnyak, Margaret M. Murnane, Jianwei Miao

    Abstract: Magnetic topological defects are energetically stable spin configurations characterized by symmetry breaking. Vortices and skyrmions are two well-known examples of 2D spin textures that have been actively studied for both fundamental interest and practical applications. However, experimental evidence of the 3D spin textures has been largely indirect or qualitative to date, due to the difficulty of… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  18. arXiv:2007.03881  [pdf

    cond-mat.mtrl-sci

    Direct observation of 3D atomic packing in monatomic amorphous materials

    Authors: Yakun Yuan, Dennis S. Kim, Jihan Zhou, Dillan J. Chang, Fan Zhu, Yasutaka Nagaoka, Yao Yang, Minh Pham, Stanley J. Osher, Ou Chen, Peter Ercius, Andreas K. Schmid, Jianwei Miao

    Abstract: Liquids and solids are two fundamental states of matter. However, due to the lack of direct experimental determination, our understanding of the 3D atomic structure of liquids and amorphous solids remained speculative. Here we advance atomic electron tomography to determine for the first time the 3D atomic positions in monatomic amorphous materials, including a Ta thin film and two Pd nanoparticle… ▽ More

    Submitted 2 December, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  19. arXiv:2006.06919  [pdf, other

    cs.LG math.DS stat.ML

    MomentumRNN: Integrating Momentum into Recurrent Neural Networks

    Authors: Tan M. Nguyen, Richard G. Baraniuk, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: Designing deep neural networks is an art that often involves an expensive search over candidate architectures. To overcome this for recurrent neural nets (RNNs), we establish a connection between the hidden state dynamics in an RNN and gradient descent (GD). We then integrate momentum into this framework and propose a new family of RNNs, called {\em MomentumRNNs}. We theoretically prove and numeri… ▽ More

    Submitted 11 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, 11 figures, Accepted for publication at Advances in Neural Information Processing Systems (NeurIPS) 2020

    MSC Class: 68T07 ACM Class: I.2

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2020

  20. arXiv:2004.12210  [pdf, ps, other

    math.OC

    Computational methods for nonlocal mean field games with applications

    Authors: Siting Liu, Matthew Jacobs, Wuchen Li, Levon Nurbekyan, Stanley J. Osher

    Abstract: We introduce a novel framework to model and solve mean-field game systems with nonlocal interactions. Our approach relies on kernel-based representations of mean-field interactions and feature-space expansions in the spirit of kernel methods in machine learning. We demonstrate the flexibility of our approach by modeling various interaction scenarios between agents. Additionally, our method yields… ▽ More

    Submitted 28 April, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    MSC Class: Primary: 35Q89; 49N80; 35A15; 65M70; 93A16 Secondary: 35Q91; 35Q93; 93A15

  21. arXiv:2003.00631  [pdf, other

    cs.LG cs.AI stat.ML

    Sparsity Meets Robustness: Channel Pruning for the Feynman-Kac Formalism Principled Robust Deep Neural Nets

    Authors: Thu Dinh, Bao Wang, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs remains widely open. In this paper, we focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep l… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 16 pages, 7 figures

    MSC Class: 68T01

  22. arXiv:2002.10583  [pdf, other

    cs.LG cs.NE stat.ML

    Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent

    Authors: Bao Wang, Tan M. Nguyen, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Stochastic gradient descent (SGD) with constant momentum and its variants such as Adam are the optimization algorithms of choice for training deep neural networks (DNNs). Since DNN training is incredibly computationally expensive, there is great interest in speeding up the convergence. Nesterov accelerated gradient (NAG) improves the convergence rate of gradient descent (GD) for convex optimizatio… ▽ More

    Submitted 26 April, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 35 pages, 16 figures, 18 tables

  23. arXiv:2002.10113  [pdf, other

    cs.LG cs.MA math.OC stat.ML

    Alternating the Population and Control Neural Networks to Solve High-Dimensional Stochastic Mean-Field Games

    Authors: Alex Tong Lin, Samy Wu Fung, Wuchen Li, Levon Nurbekyan, Stanley J. Osher

    Abstract: We present APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs). Our algorithm is geared toward high-dimensional instances of MFGs that are beyond reach with existing solution methods. We achieve this in two steps. First, we take advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-c… ▽ More

    Submitted 14 July, 2023; v1 submitted 24 February, 2020; originally announced February 2020.

  24. arXiv:1907.06800  [pdf, other

    cs.LG math.NA stat.ML

    Graph Interpolating Activation Improves Both Natural and Robust Accuracies in Data-Efficient Deep Learning

    Authors: Bao Wang, Stanley J. Osher

    Abstract: Improving the accuracy and robustness of deep neural nets (DNNs) and adapting them to small training data are primary tasks in deep learning research. In this paper, we replace the output activation function of DNNs, typically the data-agnostic softmax function, with a graph Laplacian-based high dimensional interpolating function which, in the continuum limit, converges to the solution of a Laplac… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: 34 pages, 10 figures

    MSC Class: 68T01; 68T45

  25. arXiv:1906.12056  [pdf, other

    cs.LG cs.CR stat.ML

    DP-LSSGD: A Stochastic Optimization Method to Lift the Utility in Privacy-Preserving ERM

    Authors: Bao Wang, Quanquan Gu, March Boedihardjo, Farzin Barekat, Stanley J. Osher

    Abstract: Machine learning (ML) models trained by differentially private stochastic gradient descent (DP-SGD) have much lower utility than the non-private ones. To mitigate this degradation, we propose a DP Laplacian smoothing SGD (DP-LSSGD) to train ML models with differential privacy (DP) guarantees. At the core of DP-LSSGD is the Laplacian smoothing, which smooths out the Gaussian noise used in the Gauss… ▽ More

    Submitted 7 December, 2019; v1 submitted 28 June, 2019; originally announced June 2019.

    Comments: 21 pages, 7 figures

    MSC Class: 68T05

  26. arXiv:1901.06827  [pdf, other

    cs.LG math.DS math.NA stat.ML

    A Deterministic Gradient-Based Approach to Avoid Saddle Points

    Authors: Lisa Maria Kreusser, Stanley J. Osher, Bao Wang

    Abstract: Loss functions with a large number of saddle points are one of the major obstacles for training modern machine learning models efficiently. First-order methods such as gradient descent are usually the methods of choice for training machine learning models. However, these methods converge to saddle points for certain choices of initial guesses. In this paper, we propose a modification of the recent… ▽ More

    Submitted 28 September, 2020; v1 submitted 21 January, 2019; originally announced January 2019.

  27. arXiv:1811.10745  [pdf, other

    cs.LG cs.CR math.NA stat.ML

    ResNets Ensemble via the Feynman-Kac Formalism to Improve Natural and Robust Accuracies

    Authors: Bao Wang, Binjie Yuan, Zuoqiang Shi, Stanley J. Osher

    Abstract: Empirical adversarial risk minimization (EARM) is a widely used mathematical framework to robustly train deep neural nets (DNNs) that are resistant to adversarial attacks. However, both natural and robust accuracies, in classifying clean and adversarial images, respectively, of the trained robust models are far from satisfactory. In this work, we unify the theory of optimal control of transport eq… ▽ More

    Submitted 10 June, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 18 pages, 6 figures

    MSC Class: 68Txx

  28. arXiv:1811.06492  [pdf, other

    cs.LG cs.CR stat.ML

    Mathematical Analysis of Adversarial Attacks

    Authors: Zehao Dou, Stanley J. Osher, Bao Wang

    Abstract: In this paper, we analyze efficacy of the fast gradient sign method (FGSM) and the Carlini-Wagner's L2 (CW-L2) attack. We prove that, within a certain regime, the untargeted FGSM can fool any convolutional neural nets (CNNs) with ReLU activation; the targeted FGSM can mislead any CNNs with ReLU activation to classify any given image into any prescribed class. For a special two-layer neural network… ▽ More

    Submitted 25 November, 2018; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: 21 pages

  29. arXiv:1810.12993  [pdf, other

    math.NA

    Diagnosing Forward Operator Error Using Optimal Transport

    Authors: Michael A. Puthawala, Cory D. Hauck, Stanley J. Osher

    Abstract: We investigate overdetermined linear inverse problems for which the forward operator may not be given accurately. We introduce a new tool called the structure, based on the Wasserstein distance, and propose the use of this to diagnose and remedy forward operator error. Computing the structure turns out to use an easy calculation for a Euclidean homogeneous degree one distance, the Earth Mover's Di… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

  30. arXiv:1809.08622  [pdf, ps, other

    math.NA

    Error estimation of weighted nonlocal Laplacian on random point cloud

    Authors: Zuoqiang Shi, Bao Wang, Stanley J. Osher

    Abstract: We analyze the convergence of the weighted nonlocal Laplacian (WNLL) on high dimensional randomly distributed data. The analysis reveals the importance of the scaling weight $μ\sim P|/|S|$ with $|P|$ and $|S|$ be the number of entire and labeled data, respectively. The result gives a theoretical foundation of WNLL for high dimensional data interpolation.

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: 15 pages; 2 figures

    MSC Class: 65-00

  31. arXiv:1809.08516  [pdf, other

    cs.LG math.NA stat.ML

    Adversarial Defense via Data Dependent Activation Function and Total Variation Minimization

    Authors: Bao Wang, Alex T. Lin, Wei Zhu, Penghang Yin, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: We improve the robustness of Deep Neural Net (DNN) to adversarial attacks by using an interpolating function as the output activation. This data-dependent activation remarkably improves both the generalization and robustness of DNN. In the CIFAR10 benchmark, we raise the robust accuracy of the adversarially trained ResNet20 from $\sim 46\%$ to $\sim 69\%$ under the state-of-the-art Iterative Fast… ▽ More

    Submitted 29 April, 2020; v1 submitted 22 September, 2018; originally announced September 2018.

    Comments: 17 pages, 6 figures

    MSC Class: 68Pxx

    Journal ref: Inverse Problems and Imaging, 2020

  32. arXiv:1808.03228  [pdf, other

    math.NA

    Modeling Environmental Crime in Protected Areas Using the Level Set Method

    Authors: David J. Arnold, Dayne Fernandez, Ruizhe Jia, Christian Parkinson, Deborah Tonne, Yotam Yaniv, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: National parks often serve as hotspots for environmental crime such as illegal deforestation and animal poaching. Previous attempts to model environmental crime were either discrete and network-based or required very restrictive assumptions on the geometry of the protected region and made heavy use of radial symmetry. We formulate a level set method to track criminals inside a protected region whi… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Comments: 18 pages, 12 figures

  33. arXiv:1802.00168  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Neural Nets with Interpolating Function as Output Activation

    Authors: Bao Wang, Xiyang Luo, Zhen Li, Wei Zhu, Zuoqiang Shi, Stanley J. Osher

    Abstract: We replace the output layer of deep neural nets, typically the softmax function, by a novel interpolating function. And we propose end-to-end training and testing algorithms for this new architecture. Compared to classical neural nets with softmax function as output activation, the surrogate with interpolating function as output activation combines advantages of both deep and manifold learning. Th… ▽ More

    Submitted 16 June, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: 11 pages, 4 figures

    MSC Class: 68Txx

  34. arXiv:1711.08833  [pdf, other

    cs.LG math.NA stat.ML

    Deep Learning for Real-Time Crime Forecasting and its Ternarization

    Authors: Bao Wang, Penghang Yin, Andrea L. Bertozzi, P. Jeffrey Brantingham, Stanley J. Osher, Jack Xin

    Abstract: Real-time crime forecasting is important. However, accurate prediction of when and where the next crime will happen is difficult. No known physical model provides a reasonable approximation to such a complex system. Historical crime data are sparse in both space and time and the signal of interests is weak. In this work, we first present a proper representation of crime data. We then adapt the spa… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: 14 pages, 7 figures

    MSC Class: 62-07

  35. arXiv:1404.1370  [pdf, ps, other

    math.NA

    An L1 Penalty Method for General Obstacle Problems

    Authors: Giang Tran, Hayden Schaeffer, William M. Feldman, Stanley J. Osher

    Abstract: We construct an efficient numerical scheme for solving obstacle problems in divergence form. The numerical method is based on a reformulation of the obstacle in terms of an L1-like penalty on the variational problem. The reformulation is an exact regularizer in the sense that for large (but finite) penalty parameter, we recover the exact solution. Our formulation is applied to classical elliptic o… ▽ More

    Submitted 4 April, 2014; originally announced April 2014.

    Comments: 20 pages, 18 figures

  36. arXiv:1403.6883  [pdf, other

    cond-mat.mtrl-sci math-ph quant-ph

    Compressed Wannier modes found from an $L_1$ regularized energy functional

    Authors: Farzin Barekat, Ke Yin, Russel E. Caflisch, Stanley J. Osher, Rongjie Lai, Vidvuds Ozolins

    Abstract: We propose a method for calculating Wannier functions of periodic solids directly from a modified variational principle for the energy, subject to the requirement that the Wannier functions are orthogonal to all their translations ("shift-orthogonality"). Localization is achieved by adding an $L_1$ regularization term to the energy functional. This approach results in "compressed" Wannier modes wi… ▽ More

    Submitted 26 March, 2014; originally announced March 2014.

    Comments: 5 pages, 3 figures

  37. arXiv:1311.5850  [pdf, ps, other

    math.AP

    PDEs with Compressed Solutions

    Authors: Russel E. Caflisch, Stanley J. Osher, Hayden Schaeffer, Giang Tran

    Abstract: Sparsity plays a central role in recent developments in signal processing, linear algebra, statistics, optimization, and other fields. In these developments, sparsity is promoted through the addition of an $L^1$ norm (or related quantity) as a constraint or penalty in a variational principle. We apply this approach to partial differential equations that come from a variational quantity, either by… ▽ More

    Submitted 1 August, 2014; v1 submitted 22 November, 2013; originally announced November 2013.

    Comments: 21 pages, 15 figures

  38. arXiv:1207.6430  [pdf, other

    stat.ML cs.LG stat.AP

    Optimal Data Collection For Informative Rankings Expose Well-Connected Graphs

    Authors: Braxton Osting, Christoph Brune, Stanley J. Osher

    Abstract: Given a graph where vertices represent alternatives and arcs represent pairwise comparison data, the statistical ranking problem is to find a potential function, defined on the vertices, such that the gradient of the potential function agrees with the pairwise comparisons. Our goal in this paper is to develop a method for collecting data for which the least squares estimator for the ranking proble… ▽ More

    Submitted 4 June, 2014; v1 submitted 26 July, 2012; originally announced July 2012.

    Comments: 31 pages, 10 figures, 3 tables

    Report number: UCLA CAM report 12-32 MSC Class: 62F07; 05C40; 49N45;