Newest 'machine-learning+nlp' Questions - Data Science Stack Exchange

0 votes

0 answers

11 views

NER with custom tags and no training data, zero shot approach help

I am building a "field tagger" for documents. Basically, a document, in my case something like a proposal or sales quote, would have a bunch of entities scattered throughout it, and we want ...

redbull_nowings

1

asked Jul 17 at 21:18

0 votes

0 answers

20 views

NLP: how to handle bad tokenization

I get nonsense when trying to translate the following german sentence to swedish using google/madlad400-3b-mt: a. Natürliche Personen: BundID mit ELSTER-Zertifikat oder nPA/eID/eAT-Authentifizierung ...

Mathermind

1

asked Jun 12 at 3:50

0 votes

0 answers

16 views

Latest Tree-based models

What are the latest Tree-based models that are used in machine learning? Tell the new models except the old ones such as the Decision tree, Random Forest, Gradient Boosting, LightGBM, XGBoost, and ...

Madhes Monnish

1

asked May 31 at 7:03

1 vote

1 answer

62 views

Improving GPU Utilization in LLM Inference System

I´m trying to build a distributed LLM inference platform with Huggingface support. The implementation involves utilizing Python for model processing and Java for interfacing with external systems. ...

Cardstdani

111

asked May 14 at 16:17

0 votes

0 answers

4 views

Leveraging Extra Data to Enhance Text Clustering

I want to cluster thousands of text data (called corpus A) and find a label for each cluster. Accuary of clustering is significantly important, because I want to use the texts and their labels for ...

Mohammadreza Riahi

113

asked May 8 at 22:27

0 votes

0 answers

11 views

jar files downloading very slowly in jupyter notebook in Mac Book(M2 pro)

Required jar files are downloading from maven repository in Jupyter notebook are very slow in Mac book (M2 pro). how can i increase the speed of download?

Tovlk

43

asked May 6 at 10:13

0 votes

0 answers

31 views

Multilabel Classification - Flat Binary Classifiers vs Hierarchical Binary Classifiers

Was researching on multi label classification to solve the problem of tagging news articles with topics and countries, where tags follow the syntax <topic>-<country>, and would like to ...

curious-24-7

1

asked Apr 10 at 15:54

0 votes

1 answer

22 views

Question about contextual embeddings?

How do BERT and RoBERTa generate contextual embeddings? The articles I've read keep saying that transformer encoders work bidirectionally. Because of self-attention, they can look at every token, ...

user161665

asked Mar 20 at 18:53

0 votes

0 answers

71 views

Stream response from custom RASA actions to the chatbot

I am using RASA PRO with CALM. I was thinking of using openai api within a custom action and stream the streaming response coming from openai to my chatbot. Openai is giving me streaming response and ...

Avatar

1

asked Mar 20 at 11:18

0 votes

1 answer

37 views

What's the purpose of using MLM when pretraining?

If BERT is a stack of transformer encoders, and the encoder already operates bidirectionally, understanding both left and right contexts and generating contextual embeddings, what is the purpose of ...

user159173

asked Mar 12 at 9:13

0 votes

1 answer

37 views

How do transformer-based architectures generate contextual embeddings?

How do transformer-based architectures, such as Roberta, etc., generate contextual embeddings? The issue is, I haven't found any articles that explain this process.

user159173

asked Mar 11 at 8:33

0 votes

1 answer

59 views

Fine tuning or just feature extraction or both using Roberta?

I'm reading a program that use the pre-trained Roberta model (roberta-base). The code first extracts word embeddings from each caption in the batch, using the last hidden state of the Roberta model. ...

user159173

asked Mar 8 at 18:58

0 votes

0 answers

9 views

Reducing language bias for text classification, transformer model

I am working on a text classification model predicting classes for text. We have languages from many parts of the world and some of our classes are dominated by specific languages. The model we are ...

Carl Rynegardh

876

asked Mar 7 at 9:46

0 votes

0 answers

128 views

RAG - how to deal with numerical data

I have a car marker companies data . I am creating chunks for different car models in llama index and using vector store index and it is giving decent outputs when asked questions . It fails poorly ...

Pulkit Mehta

1

asked Mar 5 at 12:41

1 vote

0 answers

39 views

Training Models Directly with Transformer Attention Weights: A Viable Strategy?

I'm currently using pre-trained transformers to extract embeddings for sequence analysis, which are then used in downstream tasks. My process involves using the extracted embeddings as features for ...

pparker

402

asked Mar 3 at 4:35

Stack Exchange Network

All Questions

NER with custom tags and no training data, zero shot approach help

NLP: how to handle bad tokenization

Latest Tree-based models

Improving GPU Utilization in LLM Inference System

Leveraging Extra Data to Enhance Text Clustering

jar files downloading very slowly in jupyter notebook in Mac Book(M2 pro)

Multilabel Classification - Flat Binary Classifiers vs Hierarchical Binary Classifiers

Question about contextual embeddings?

Stream response from custom RASA actions to the chatbot

What's the purpose of using MLM when pretraining?

How do transformer-based architectures generate contextual embeddings?

Fine tuning or just feature extraction or both using Roberta?

Reducing language bias for text classification, transformer model

RAG - how to deal with numerical data

Training Models Directly with Transformer Attention Weights: A Viable Strategy?

Hot Network Questions

All Questions

Related Tags