Every millisecond matters. We’re open sourcing the tokenizer we built and deployed on production; th...
Perplexity 开源其高效的 Unigram 分词器,CPU 利用率降低 5-6 倍,显著减少延迟。
入选理由:Perplexity 开源 Unigram 分词器,CPU 利用率降低 5-6 倍。
产品
别名:Perplexity AI、Perplexity Health
一家提供人工智能驱动的搜索和数据分析服务的公司,其产品包括 Perplexity Health,用于整合和分析健康数据。
已跟踪 30 条高相关材料
最近变化
2026-06-02 · Perplexity 现在支持从 iPhone 的 Apple Health 导入睡眠、活动和 HRV 数据。
为什么值得关注
Perplexity 被反复提及时,通常意味着它正在影响产品路线、开发者工作流或 AI 产业判断。这个页面把分散材料合并成一个可持续更新的观察入口。
Every millisecond matters. We’re open sourcing the tokenizer we built and deployed on production; th...
Aravind Srinivas(@AravSrinivas) · 8.5 分
Perplexity 开源其高效的 Unigram 分词器,CPU 利用率降低 5-6 倍,显著减少延迟。
At production input lengths, the encoder cuts p50 latency by roughly 5× vs. HuggingFace tokenizers, ...
Perplexity(@perplexity_ai) · 8.5 分
Perplexity 的编码器在生产输入长度下将 p50 延迟降低了约 5 倍,相比 HuggingFace 分词器,2 倍相比 SentencePiece C++,1.5 倍相比 IREE C。
We're open-sourcing the Unigram tokenizer we rebuilt to reduce CPU utilization by 5-6x. Small reran...
Perplexity(@perplexity_ai) · 8.5 分
Perplexity开源了重构的Unigram分词器,CPU利用率降低5-6倍。
已收录 30 条与 Perplexity 相关的内容,按评分排序。
Perplexity 开源其高效的 Unigram 分词器,CPU 利用率降低 5-6 倍,显著减少延迟。
入选理由:Perplexity 开源 Unigram 分词器,CPU 利用率降低 5-6 倍。
Perplexity开源了重构的Unigram分词器,CPU利用率降低5-6倍。
入选理由:Perplexity开源了Unigram分词器,CPU利用率降低5-6倍。
Perplexity 的编码器在生产输入长度下将 p50 延迟降低了约 5 倍,相比 HuggingFace 分词器,2 倍相比 SentencePiece C++,1.5 倍相比 IREE C。
入选理由:Perplexity 编码器在生产输入长度下延迟降低约 5 倍
GB 200s improve the prefill and decode disaggregation efficiency for large MoE models like Qwen, significantly enhancing throughput compared to the Hopper platform.
入选理由:GB 200s 在高吞吐量推理方面比 Hopper 更适合大型 MoE 模型。
Perplexity published new research on deploying the Qwen3 235B model on NVIDIA GB200 NVL72 Blackwell racks, showing that GB200 outperforms Hopper in high-throughput inference for large MoE models.
入选理由:Qwen3 235B 模型在 NVIDIA GB200 上实现了高效的高吞吐量推理。
Perplexity is emerging as an enterprise-grade knowledge and research platform, with PayPal running 74,000 weekly tasks for model validation, market analysis, and strategic decision-making.
入选理由:PayPal 每周在 Perplexity Enterprise 上执行 74,000 个任务,覆盖核心研究流程。
Perplexity achieves default-secure computing architecture: each task runs in a hardware-isolated sandbox with VPC-level storage-compute separation and short-lived proxy tokens for agent authentication.
入选理由:所有任务运行在硬件级隔离的沙箱环境中,实现强安全隔离。
Lovable has rebuilt its apps with server-side rendering (SSR) to improve default discoverability in search engines and AI answer engines like Google, ChatGPT, and Perplexity, migrating its foundation to TanStack Start for better type safety and deployment flexibility.
入选理由:Lovable 应用通过服务端渲染(SSR)提升了在 Google、ChatGPT 和 Perplexity 中的默认可发现性。
Deep enterprise adoption of AI tools requires sustained investment in security engineering, with Perplexity implementing sandboxing and automated security workflows for risk control.
入选理由:企业部署AI工具需持续投资安全工程以保障可信运行
Perplexity open-sources its internal security tool Bumblebee to protect developer systems and enhance product safety.
入选理由:Bumblebee 最初是 Perplexity 内部开发的安全工具。
Computer now connects to Snowflake, enabling real-time analysis of enterprise data.
入选理由:Computer 可以连接 Snowflake 数据仓库进行实时分析。
Perplexity builds an AI agent skill system based on composability and evaluation, enabling automation of complex tasks by decomposing multi-step reasoning into atomic skill modules and continuously optimizing via real user behavior data.
入选理由:Perplexity将复杂任务拆解为12类原子化代理技能,如信息检索与验证。
Perplexity has published its internal guide for building AI agent skills, emphasizing a new mindset required for developers to design, refine, and maintain agent capabilities.
入选理由:Perplexity发布内部Agent技能手册,涵盖设计、精炼与维护全流程。
MiniMax Agent integrates Perplexity's search infrastructure, outperforming competitors in over 700 tasks with improved quality and efficiency.
入选理由:Perplexity 在 MiniMax 的 700+ 任务基准测试中胜出
Perplexity open-sources Bumblebee, a read-only scanner for macOS and Linux that detects risky packages, extensions, and AI configurations on developer machines.
入选理由:Bumblebee 是一个开源的只读安全扫描器,支持 macOS 和 Linux 平台。
Perplexity AI announces its system can now connect directly to the Snowflake data warehouse for end-to-end analytics.
入选理由:Perplexity 现在支持直接连接 Snowflake 数据仓库。
By filtering ads, navigation, and无用内容 before handoff to the answer model, Perplexity increases vital content per snippet by 63%, achieving a 50x compression ratio while maintaining frontier-level performance.
入选理由:通过过滤广告、导航、元数据和无效内容,Perplexity使关键信息密度提升63%
Perplexity is shifting search from a web-fetch tool call to code generation to adapt to a future where code execution inside agent harnesses dominates knowledge work.
入选理由:Perplexity 推出 Search as Code 架构,直接生成 Python 调用其搜索栈。
Perplexity introduces 'Search as Code' — an architecture enabling AI agents to generate Python code that directly calls our search stack, replacing manual function call loops.
入选理由:AI Agent 可直接生成 Python 调用 Perplexity 搜索栈,无需手动循环调用。
Lovable has restructured its apps to be server-side rendered, enhancing discoverability in search engines like Google and AI answer engines such as ChatGPT and Perplexity, built on TanStack Start due to its active maintenance and strong community.
入选理由:Lovable 应用通过服务端渲染(SSR)提升了搜索引擎和 AI 答案引擎的可发现性。
Perplexity is developing a highly secure and scalable agent runtime sandbox with key features including secure API key handling, content safety detection, and encrypted data transfer.
入选理由:Perplexity 使用代理方式管理 API 密钥以增强安全性。
Perplexity introduces dashboard and automation tools based on Snowflake data to support pipeline analysis, product usage tracking, and customer segmentation.
入选理由:Perplexity 支持从 Snowflake 构建数据看板和自动化流程。
PayPal uses Perplexity Enterprise to run 74,000 weekly tasks, used for model validation, channel performance, market trend research, competitive intelligence, and product analysis.
入选理由:PayPal 使用 Perplexity Enterprise 每周运行 74,000 任务。
New apps are now server-side rendered and discoverable through search engines like Google and AI answer engines like ChatGPT, Claude, and Perplexity. Existing apps get prerendering. Additionally, access @semrush search intelligence in-app. Chat with your SEO data, understand how you're performing, and improve discoverability. You can also run SEO reviews on demand, and fix all suggestions in one click.
入选理由:应用现在支持服务器端渲染并可搜索。
Perplexity scans external content in parallel using ML classifiers and the BrowseSafe model before AI agents act, with file data encrypted and auto-deleted after 7 days.
入选理由:外部内容通过 ML 分类器与 BrowseSafe 模型并行扫描以提升安全性。
Perplexity now connects to Apple Health on iPhone, allowing users to import sleep, activity, and HRV data, and query biomarkers, blood test results, or panel analyses.
入选理由:Perplexity 现在支持从 iPhone 的 Apple Health 导入睡眠、活动和 HRV 数据。
Google’s newly launched AI feature on its homepage offers fast response and smooth UX, prompting users to reconsider paying for Perplexity Pro. However, the article lacks technical validation or benchmarks.
入选理由:Google 新AI模式响应迅速,用户体验优于传统搜索界面
The tweet humorously references the 'Zen of Python' and extends it with 'Zen of Skills', suggesting easily explainable ideas may already be known by models and should be deleted.
入选理由:‘如果实现容易解释,可能是好主意’源自Python之禅
Several well-known tech companies have their employees sharing project progress on Twitter, a practice that more companies should emulate.
入选理由:Anthropic、OpenAI、Shopify、Stripe、Notion、Cursor、xAI、Perplexity 和 Figma 的员工在推特上分享他们的项目进展。
Perplexity 提供了关于改进 Unigram 分词器 CPU 性能的博客文章链接。
入选理由:Perplexity 提供了改进 Unigram 分词器性能的博客文章。