Search Results for author: Denis Kuznedelev

Found 12 papers, 6 papers with code

Does Diffusion Beat GAN in Image Super Resolution?

no code implementations • 27 May 2024 • Denis Kuznedelev, Valerii Startsev, Daniil Shlenskii, Sergey Kastryulin

There is a prevalent opinion in the recent literature that Diffusion-based models outperform GAN-based counterparts on the Image Super Resolution (ISR) problem.

Paper
Add Code

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression

1 code implementation • 23 May 2024 • Vladimir Malinovskii, Denis Mazur, Ivan Ilin, Denis Kuznedelev, Konstantin Burlachenko, Kai Yi, Dan Alistarh, Peter Richtarik

In this work, we question the use of STE for extreme LLM compression, showing that it can be sub-optimal, and perform a systematic study of quantization-aware fine-tuning strategies for LLMs.

Quantization

876

Paper
Code

YaART: Yet Another ART Rendering Technology

no code implementations • 8 Apr 2024 • Sergey Kastryulin, Artem Konev, Alexander Shishenya, Eugene Lyapustin, Artem Khurshudov, Alexander Tselousov, Nikita Vinokurov, Denis Kuznedelev, Alexander Markovich, Grigoriy Livshits, Alexey Kirillov, Anastasiia Tabisheva, Liubov Chubarova, Marina Kaminskaia, Alexander Ustyuzhanin, Artemii Shvetsov, Daniil Shlenskii, Valerii Startsev, Dmitrii Kornilov, Mikhail Romanov, Artem Babenko, Sergei Ovcharenko, Valentin Khrulkov

In the rapidly progressing field of generative models, the development of efficient and high-fidelity text-to-image diffusion systems represents a significant frontier.

Paper
Add Code

Extreme Compression of Large Language Models via Additive Quantization

1 code implementation • 11 Jan 2024 • Vage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh

The emergence of accurate open large language models (LLMs) has led to a race towards quantization techniques for such models enabling execution on end-user devices.

Quantization

876

Paper
Code

Sparse Fine-tuning for Inference Acceleration of Large Language Models

2 code implementations • 10 Oct 2023 • Eldar Kurtic, Denis Kuznedelev, Elias Frantar, Michael Goin, Dan Alistarh

While the standard approach is to leverage sparsity for computational reduction, we observe that in the case of memory-bound LLMs sparsity can also be leveraged for reducing memory bandwidth.

Quantization Text Generation +1

2,904

Paper
Code

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

no code implementations • 3 Aug 2023 • Denis Kuznedelev, Eldar Kurtic, Eugenia Iofinova, Elias Frantar, Alexandra Peste, Dan Alistarh

Obtaining versions of deep neural networks that are both highly-accurate and highly-sparse is one of the main challenges in the area of model compression, and several high-performance pruning techniques have been investigated by the community.

Model Compression Network Pruning +1

Paper
Add Code

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

1 code implementation • 5 Jun 2023 • Tim Dettmers, Ruslan Svirschevski, Vage Egiazarian, Denis Kuznedelev, Elias Frantar, Saleh Ashkboos, Alexander Borzunov, Torsten Hoefler, Dan Alistarh

Recent advances in large language model (LLM) pretraining have led to high-quality LLMs with impressive abilities.

Language Modelling Large Language Model +1

512

Paper
Code

Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression

no code implementations • 25 Mar 2023 • Denis Kuznedelev, Soroush Tabesh, Kimia Noorbakhsh, Elias Frantar, Sara Beery, Eldar Kurtic, Dan Alistarh

To address this, we ask: can we quickly compress large generalist models into accurate and efficient specialists?

Paper
Add Code

A critical look at the evaluation of GNNs under heterophily: Are we really making progress?

2 code implementations • 22 Feb 2023 • Oleg Platonov, Denis Kuznedelev, Michael Diskin, Artem Babenko, Liudmila Prokhorenkova

Graphs without this property are called heterophilous, and it is typically assumed that specialized methods are required to achieve strong performance on such graphs.

Graph Representation Learning Node Classification

Paper
Code

CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models

no code implementations • NeurIPS 2023 • Denis Kuznedelev, Eldar Kurtic, Elias Frantar, Dan Alistarh

To further showcase CAP's accuracy and scalability, we use it to show for the first time that extremely-accurate large vision models, trained via self-supervised techniques, can also be pruned to moderate sparsities, with negligible accuracy loss.

Image Classification Quantization

Paper
Add Code

Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond

no code implementations • NeurIPS 2023 • Oleg Platonov, Denis Kuznedelev, Artem Babenko, Liudmila Prokhorenkova

For this, we formalize desirable properties for a proper homophily measure and verify which measures satisfy which properties.

Informativeness Node Classification

Paper
Add Code

A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta

1 code implementation • 22 Jun 2022 • Maksim Velikanov, Denis Kuznedelev, Dmitry Yarotsky

Mini-batch SGD with momentum is a fundamental algorithm for learning large predictive models.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.