Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

@W-16485311: Core Logic for Selecting Records from Target Org #3818

Open
wants to merge 33 commits into
base: main
Choose a base branch
from

Conversation

aditya-balachander
Copy link
Contributor

@aditya-balachander aditya-balachander commented Aug 19, 2024

W-16485311

Select_Operation.mp4

@aditya-balachander aditya-balachander requested a review from a team as a code owner August 19, 2024 19:13
@aditya-balachander aditya-balachander added cla:signed internal Pull requests from IPDE team labels Aug 20, 2024
cumulusci/tasks/bulkdata/select_utils.py Outdated Show resolved Hide resolved
cumulusci/tasks/bulkdata/tests/test_step.py Outdated Show resolved Hide resolved
cumulusci/tasks/bulkdata/tests/test_step.py Outdated Show resolved Hide resolved
@jlantz
Copy link
Contributor

jlantz commented Oct 1, 2024

Out of curiosity, would this feature enable selects against the User object? I have a current use case for querying User records and relating them to data in a dataset using the user alias field value. It would be really cool if dataset extract could do that mapping to user alias automatically :)

@aditya-balachander
Copy link
Contributor Author

Hey @jlantz,

This tool is primarily designed to load data into a Salesforce org, not extract it. While I'm not sure what you mean about user aliases, you can theoretically add the "User" object to the mapping file and set its action to "select." This means that when you load data into the target org, the tool will query for existing user records in the target org instead of creating new ones. It will then link these users to other objects based on lookup relationships. Hope this helps :)

@aditya-balachander
Copy link
Contributor Author

To further clarify, the goal is to enhance the tool by introducing a feature that allows users to select existing records in the target org. This would provide a way to reference these existing records when creating related child entities during the data load process.

Comment on lines 5 to 9
import numpy as np
import pandas as pd
from annoy import AnnoyIndex
from sklearn.feature_extraction.text import HashingVectorizer
from sklearn.preprocessing import StandardScaler
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @mjawadtp, I guess we need to also make changes in the requirements/dev.txt and requirements/prod.txt files to include these new imports.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discuss with James

cumulusci/tasks/bulkdata/select_utils.py Outdated Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla:signed internal Pull requests from IPDE team
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants