来自 JPMorgan 的 DocLLM：一种面向布局的生成式语言模型，能理解多模态文档对于企业文档来说，不仅仅是文本类型，还有很多复杂的类型，例如表格、发票、收据、报告、合同等，其中都包含着丰富的文字和空间交互信息。这些文档复杂的布局提供了视觉线索，对于有效理解这些文档至关重要。…

发布时间: 2024-01-03 13:00:35

1分

数据加载中

来自 JPMorgan 的 DocLLM：一种面向布局的生成式语言模型，能理解多模态文档
对于企业文档来说，不仅仅是文本类型，还有很多复杂的类型，例如表格、发票、收据、报告、合同等，其中都包含着丰富的文字和空间交互信息。这些文档复杂的布局提供了视觉线索，对于有效理解这些文档至关重要。…
IT技术
( twitter.com )

来自 JPMorgan 的 DocLLM：一种面向布局的生成式语言模型，能理解多模态文档

对于企业文档来说，不仅仅是文本类型，还有很多复杂的类型，例如表格、发票、收据、报告、合同等，其中都包含着丰富的文字和空间交互信息。这些文档复杂的布局提供了视觉线索，对于有效理解这些文档至关重要。

本论文以此建议了一种轻量级扩展的大语言模型（LLMs） - DocLLM，这款模型可在处理可视文档时，同时考虑到文本语义和空间布局。该模型与现有的多模态语言模型（LLMs）的最大不同在于，它没有使用计算成本高昂的图像编码器，而是通过边框信息来整合空间布局。

具体来说，DocLLM 通过将文本和空间模态之间的交叉对齐分解为一组独立矩阵来处理既定的 Transformer 的注意力机制。

此外，DocLLM 还设计了一个预训练目标，学习如何自动填充文本段落。这种方式使其能更好地处理常见的视觉文档中的不规则布局和混合内容。

DocLLM 使用大型指令数据集对预训练模型进行了微调，覆盖了四个主要的文档智能任务。

DocLLM 的解决方案在所有任务的16个数据集中的14个上优于现有的最先进语言模型，且在之前未曾接触过的5个数据集中的4个上有良好的应用表现。

论文地址：https://t.co/FVPvQPQsR0

点击图片查看原图

Markdown支持

评论加载中...

您可能感兴趣的：更多

BREAKING: JPMorgan says it no longer expects a recession in the United States
时政
( twitter.com)

2年前 • The Spectator Index • -- 点击 0 评论

JUST IN: JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years
时政
( twitter.com)

2年前 • The Spectator Index • -- 点击 0 评论

JUST IN: US Virgin Islands lawyer says JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years
时政
( twitter.com)

2年前 • The Spectator Index • -- 点击 0 评论

Chief Executive salary in 2023.
Morgan Stanley (James Gorman): $37 million
JPMorgan (Jamie Dimon): $36 million
Goldman Sachs (David Solomon): $31 million
Bank of America (Brian Moynihan): $29 million
时政
( twitter.com)

1年前 • The Spectator Index • -- 点击 0 评论

TECH: JPMorgan CEO Jamie Dimon says artificial intelligence could be as 'extraordinary and possibly as transformational' as major inventions of the past several hundred years, including the 'printing press, the steam engine, electricity, computing and the Internet'.
时政
( twitter.com)

1年前 • The Spectator Index • -- 点击 0 评论

🇺🇸 Stock performance, past year.
NVIDIA: +205%
Meta: +138%
Broadcom: +111%
Eli Lilly: +104%
Amazon: +80%
JPMorgan: +53%
Microsoft: +46%
Alphabet: +46%
Berkshire Hathaway: +32%
Mastercard: +30%
Visa: +21%
Walmart: +18%
ExxonMobil: +5%
Apple: +4%
Procter and Gamble: +3%
Tesla:…
时政
( twitter.com)

1年前 • The Spectator Index • -- 点击 0 评论

Share price, today.
Tesla: +15.1%
Apple: +2.4%
Exxon Mobil: +1.3%
Amazon: +0.9%
Home Depot: +0.5%
Eli Lilly: +0.5%
Nvidia: +0.4%
J&J: +0.4%
P&G: +0.2%
Walmart: +0.1%
Broadcom: -0.3%
JPMorgan: -0.3%
Berkshire Hathaway: -0.4%
Visa: -0.8%
Merck: -0.9%
Mastercard: -1%
Microsoft:…
时政
( twitter.com)

1年前 • The Spectator Index • -- 点击 0 评论

老大读一文一理双学位还有四门选修课，年底就毕业了，5月绿卡出来了，找工作恰逢其时。今天接到jpmorgan与德勤的面试通知，当然这些大企业，面试会有很多轮，看来今年工作很好找，老父亲甚感欣慰，养孩子的幸福感油然而生。
时政
( twitter.com)

2年前 • 薛船长在LA • -- 点击 0 评论

作为全球最大银行JPMorgan的CEO，杰米·戴蒙Jamie Dimon在国际经济发展/金融政策以及美国政商两届都扮演重要角色，他说话是很有分量的。戴蒙关于俄乌战争，中东冲突，美中关系，以及即将到来的美国大选的观点，是值得认真了解的。
时政
( twitter.com)

1年前 • 艾森 Essen • -- 点击 • 下载视频 0 评论

00:11:47

IT技术

BREAKING: JPMorgan says it no longer expects a recession in the United States 时政 ( twitter.com)

时政

JUST IN: JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years 时政 ( twitter.com)

时政

JUST IN: US Virgin Islands lawyer says JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years 时政 ( twitter.com)

时政

Chief Executive salary in 2023. Morgan Stanley (James Gorman): $37 million JPMorgan (Jamie Dimon): $36 million Goldman Sachs (David Solomon): $31 million Bank of America (Brian Moynihan): $29 million 时政 ( twitter.com)

时政

TECH: JPMorgan CEO Jamie Dimon says artificial intelligence could be as 'extraordinary and possibly as transformational' as major inventions of the past several hundred years, including the 'printing press, the steam engine, electricity, computing and the Internet'. 时政 ( twitter.com)

时政

时政

时政

时政

时政

创建一个新帐户

登录

BREAKING: JPMorgan says it no longer expects a recession in the United States
时政
( twitter.com)

JUST IN: JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years
时政
( twitter.com)

JUST IN: US Virgin Islands lawyer says JPMorgan Chase processed over $1 billion for Jeffrey Epstein over the course of 16 years
时政
( twitter.com)

Chief Executive salary in 2023.
Morgan Stanley (James Gorman): $37 million
JPMorgan (Jamie Dimon): $36 million
Goldman Sachs (David Solomon): $31 million
Bank of America (Brian Moynihan): $29 million
时政
( twitter.com)

TECH: JPMorgan CEO Jamie Dimon says artificial intelligence could be as 'extraordinary and possibly as transformational' as major inventions of the past several hundred years, including the 'printing press, the steam engine, electricity, computing and the Internet'.
时政
( twitter.com)