Review on Elastic On-Device LLM Service
Motivation Different LLM requests need different levels of LLM services, such as time-to-first-token (TTFT) and time-per-output-token (TPOT). For instance, voice assistants require low TTFT to pr...
Motivation Different LLM requests need different levels of LLM services, such as time-to-first-token (TTFT) and time-per-output-token (TPOT). For instance, voice assistants require low TTFT to pr...
Feed-Forward Network Feed-forward network is a type of network whose output is not a part of its input. Multi-Layer Perception Multi-layer perception (MLP) is a type of feed-forward neural netwo...
UML (Universe Modelling Language) is a language used to model software systems. It is a graphical language that allows developers to visualize the structure and behavior of a software system. In t...
Loop device 循环设备,是linux中的一种虚拟块设备,作用是把文件伪装成磁盘设备使用。 为什么叫循环设备? 因为通常读写路径为:应用 → 文件系统 → 块设备 (/dev/sda) → 硬盘, 而循环设备的读写路径为:应用 → 文件系统 → /dev/loop0 → 内核 → 普通文件 → 文件系统 → 磁盘, 从写文件绕回到了写文件,而不是直接写磁盘。
感悟 尽早释放锁!锁的功能要明确。
I faced an issue implementing BusTub P1 correctly. It’s all about move semantic and resource ownership. I found the “copy-and-swap idiom” very helpful in solving this problem. Here’s a brief explan...
LL grammar is a kind of top down grammar (自顶向下文法). 自顶向下 的两个动作:匹配,推导(展开)。 从开始符出发,推导出给定的串。 LL grammar Definition: see https://en.wikipedia.org/wiki/LL_grammar. 对于 \(\text{LL}(k)...
It’s a platform-independent language that abstracts above object code. Intermediate code 栈式中间代码-后缀式 四元式 (op, arg1, arg2, result) [!INFO] 如果你是江西师大的学生,那还需要知道以下注意事项: 输入输出的写法: ...
流程 建立概念模型 ER图转化为关系模型 若ER图过大,则需先画分ER图再合并。 建立概念模型:画ER图 方框表示实体 菱形表示联系 椭圆表示属性 ER图转化为关系模型 模型转化: 实体 -> 表 联系 -> 关系 做一下书上 P239 F7,8,9 标出表名,属性,主码(下划线) 形式: TableName(...
Recently, I ran into an issue with the NVIDIA driver after an unexpected Windows update. The update reordered the BIOS boot entries, and when I subsequently booted into Fedora 43, the NVIDIA kernel...