Wink - AI原生创新，忠于用户，专属智能体验

Discover Amazing Content, Share Life Moments

Connect Our Wonderful World

Quantifying Model Evaluation: Creating a Benchmark for Different Precision Models

Inspired by community discussions, I plan to establish a benchmark for quantized models to evaluate the relationship between precision loss and VRAM/performance gains, covering areas such as programming, mathematics, translation, and general knowledge.

2025-10-22 21:04:42Read More

Using Ollama to Write Papers? Don't Expect One-Click Generation

When trying to write long papers with AI tools, you often encounter content repetition or mid-process interruptions. Adjusting context length and building content frameworks step-by-step might be more effective, but don't expect it to completely replace your own thinking and revision process.

2025-10-22 21:02:40Read More

DeepSeek-OCR Tested: Finding the Sweet Spot Between Compression Ratio and Accuracy

Testing reveals DeepSeek-OCR maintains 97% accuracy at 10x compression with significant visual token compression, but accuracy drops sharply beyond 12x compression.

2025-10-22 20:04:57Read More

7 Cognitive Superpowers of Superintelligence

In 'Superintelligence,' Nick Bostrom outlines seven cognitive capabilities that superintelligent AI might possess that far exceed human capacities, from strategic planning to social manipulation, each one thought-provoking in its implications.

2025-10-22 20:02:46Read More

Perplexity's 42-Page Work Guide and a Little Online Farce

A so-called internal AI work guide from Perplexity gained attention on X but was quickly revealed to be just publicly available product promotional material. The incident itself is minor, but the underlying interaction patterns are worth examining.

2025-10-22 19:31:20Read More

1.5x Faster Inference for LoRA Fine-Tuned Models: How We Solved PEFT Service Performance Bottlenecks

The Databricks team achieved a 1.5x increase in inference throughput for LoRA fine-tuned models while maintaining model quality by customizing the inference engine. The key lies in innovations like quantization strategies, kernel overlap, and streamed multi-processor partitioning.

2025-10-22 19:31:20Read More

Our Human-AI Collaborative Translation Process: Let AI Do the Heavy Lifting, Humans Add the Soul

Sharing a translation workflow that has been tested in practice, through fine-grained division of labor between AI and humans, significantly improving efficiency while ensuring quality.

2025-10-22 19:31:20Read More

OCR Model Evaluation: When PaddleOCR-VL Faces Off Against MinerU2.5, MonkeyOCR, and GPT-4o

We compared PaddleOCR-VL with MinerU2.5, MonkeyOCR, and GPT-4o in terms of their performance on complex layout documents. The results show that PaddleOCR-VL performs robustly in layout detection accuracy and reading order consistency, while other models exhibited issues such as element omissions, layout misjudgments, and even content hallucinations.

2025-10-22 19:31:20Read More

Lovart AI: An AI Design Tool Integrating Image, Video, and 3D Generation

Lovart AI, as the first AI design assistant equipped with Seedream 4.0 and Nano Banana, combines mainstream image, video, and 3D model generation capabilities, supporting image editing, music generation, and speech synthesis. This article demonstrates its applications in 3D model creation, interior design, and e-commerce content generation through practical examples.

2025-10-22 19:31:20Read More

Open Source as Another Form of Talent Monopoly

Discussed with Professor Yao Dong a perspective: open source is essentially a form of talent monopoly. General compression algorithms are open source, but top researchers are concentrated in a few countries like China and the U.S., making it hard for others to catch up. The field of large models is repeating this pattern.

2025-10-22 19:31:20Read More