Highly scored unanswered questions - Artificial Intelligence Stack Exchange

16 votes

1 answer

388 views

Will parameter sweeping on one split of data followed by cross validation discover the right hyperparameters?

Let's call our dataset splits train/test/evaluate. We're in a situation where we require months of data. So we prefer to use the evaluation dataset as infrequently as possible to avoid polluting our ...

CommunityBot

1

modified Apr 15 at 8:06

11 votes

0 answers

413 views

Extending FaceNet’s triplet loss to object recognition

FaceNet uses a novel loss metric (triplet loss) to train a model to output embeddings (128-D from the paper), such that any two faces of the same identity will have a small Euclidean distance, and ...

CommunityBot

1

modified Mar 16, 2022 at 17:08

10 votes

1 answer

929 views

Are Cellular Neural Networks one type of Neural Networks?

I am researching Cellular Neural Networks and have already read Chua's two articles (1988). In cellular neural networks, a cell is only in relation with its neighbors. So it is easy to use them for ...

CommunityBot

1

modified May 28 at 0:05

8 votes

2 answers

910 views

Are there transformer-based architectures that can produce fixed-length vector encodings given arbitrary-length text documents?

BERT encodes a piece of text such that each token (usually words) in the input text map to a vector in the encoding of the text. However, this makes the length of the encoding vary as a function of ...

CommunityBot

1

modified Apr 3 at 4:02

8 votes

0 answers

282 views

Is the Bellman equation that uses sampling weighted by the Q values (instead of max) a contraction?

It is proved that the Bellman update is a contraction (1). Here is the Bellman update that is used for Q-Learning: $$Q_{t+1}(s, a) = Q_{t}(s, a) + \alpha*(r(s, a, s') + \gamma \max_{a^*} (Q_{t}(s', ...

CommunityBot

1

modified Jul 25, 2020 at 8:28

8 votes

2 answers

453 views

How can I solve the zero subset sum problem with hill climbing?

I want to solve the zero subset sum problem with the hill-climbing algorithm, but I am not sure I found a good state space for this. Here is the problem: consider we have a set of numbers and we want ...

CommunityBot

1

modified Jun 3 at 18:01

8 votes

0 answers

140 views

Normalizing Normal Distributions in Thompson Sampling for online Reinforcement Learning

In my implementation of Thompson Sampling (TS) for online Reinforcement Learning, my distribution for selecting $a$ is $\mathcal{N}(Q(s, a), \frac{1}{C(s,a)+1})$, where $C(s,a)$ is the number of times ...

nbro

41k

modified Dec 20, 2021 at 14:51

8 votes

1 answer

186 views

What is the impact of using multiple BMUs for self-organizing maps?

Here's a sort of a conceptual question. I was implementing a SOM algorithm to better understand its variations and parameters. I got curious about one bit: the BMU (best matching unit == the neuron ...

CommunityBot

1

modified May 28 at 14:04

8 votes

0 answers

125 views

What are the current trends/open questions in logics for knowledge representation?

What are the future prospects in near future from a theoretical investigation of description logics, and modal logics in the context of artificial intelligence research?

CommunityBot

1

modified Apr 28, 2019 at 18:02

8 votes

0 answers

1k views

Is there a difference in the architecture of deep reinforcement learning when multiple actions are performed instead of a single action?

I've built a deep deterministic policy gradient reinforcement learning agent to be able to handle any games/tasks that have only one action. However, the agent seems to fail horribly when there are ...

nbro

41k

modified Mar 28, 2021 at 1:32

8 votes

1 answer

4k views

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders? I cannot find any resources for that. Is it safe to assume that, since it works for other DNNs, it will also ...

CommunityBot

1

modified Apr 12 at 18:04

7 votes

2 answers

163 views

Can training a model on a dataset composed by real images and drawings hurt the training process of a real-world application model?

I'm training a multi-label classifier that's supposed to be tested on underwater images. I'm wondering if feeding the model drawings of a certain class plus real images can affect the results badly. ...

CommunityBot

1

modified Apr 27 at 10:02

7 votes

0 answers

1k views

How is the rollout from the MCTS implemented in both of the AlphaGo Zero and the AlphaZero algorithms?

In the vanilla Monte Carlo tree search (MCTS) implementation, the rollout is usually implemented following a uniform random policy, that is, it takes random actions until the game is finished and only ...

nbro

41k

modified Dec 19, 2021 at 18:14

7 votes

1 answer

154 views

How does the network know which objects to track in the paper "Label-Free Supervision of Neural Networks with Physics and Domain Knowledge"?

I was reading the paper Label-Free Supervision of Neural Networks with Physics and Domain Knowledge, published at AAAI 2017, which won the best paper award. I understand the math and it makes sense. ...

CommunityBot

1

modified Jun 1 at 8:03

7 votes

1 answer

214 views

How could an AI detect whether an enemy in a game can be blocked off/trapped?

Imagine a game played on a 10x10 grid system where a player can move up down left or right and imagine there are two players on this grid: An enemy and you. In this game, there are walls on the grid ...

CommunityBot

1

modified Jul 29, 2023 at 19:00

Stack Exchange Network

Unanswered Questions

Will parameter sweeping on one split of data followed by cross validation discover the right hyperparameters?

Extending FaceNet’s triplet loss to object recognition

Are Cellular Neural Networks one type of Neural Networks?

Are there transformer-based architectures that can produce fixed-length vector encodings given arbitrary-length text documents?

Is the Bellman equation that uses sampling weighted by the Q values (instead of max) a contraction?

How can I solve the zero subset sum problem with hill climbing?

Normalizing Normal Distributions in Thompson Sampling for online Reinforcement Learning

What is the impact of using multiple BMUs for self-organizing maps?

What are the current trends/open questions in logics for knowledge representation?

Is there a difference in the architecture of deep reinforcement learning when multiple actions are performed instead of a single action?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

Can training a model on a dataset composed by real images and drawings hurt the training process of a real-world application model?

How is the rollout from the MCTS implemented in both of the AlphaGo Zero and the AlphaZero algorithms?

How does the network know which objects to track in the paper "Label-Free Supervision of Neural Networks with Physics and Domain Knowledge"?

How could an AI detect whether an enemy in a game can be blocked off/trapped?

Unanswered Questions

Unanswered Tags