Question 1

What is Attention Mechanism?

Accepted Answer

The attention mechanism allows a neural network to weigh the importance of different parts of the input when processing each element. In transformers, "self-attention" lets every token in a sequence attend to every other token, capturing long-range dependencies. Multi-head attention runs several attention computations in parallel for richer representations.

Question 2

How does Attention Mechanism work?

Accepted Answer

In the sentence "The cat sat on the mat because it was tired," attention helps the model understand that "it" refers to "the cat" by assigning high attention weights between those tokens, even though they are far apart.

What is Attention Mechanism?

Definition

💡 Example

Related concepts

Explore AI tools