100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
BIDA 630 Data Analytics TEST (Graded A+ actual test) $8.49   Add to cart

Exam (elaborations)

BIDA 630 Data Analytics TEST (Graded A+ actual test)

 4 views  0 purchase
  • Course
  • BIDA 630 Data Analytics
  • Institution
  • BIDA 630 Data Analytics

_____________ of data is used to assess the performance of each supervised learning model so that we can compare models and pick the best one. - The test partition - The validation partition - ️️Validation The validation partition is used to assess the performance of each supervised learnin...

[Show more]

Preview 2 out of 11  pages

  • September 15, 2024
  • 11
  • 2024/2025
  • Exam (elaborations)
  • Questions & answers
  • BIDA 630 Data Analytics
  • BIDA 630 Data Analytics
avatar-seller
PatrickKaylian
BIDA 630 Data Analytics
_____________ of data is used to assess the performance of each supervised learning
model so that we can compare models and pick the best one.

- The test partition
- The validation partition - ✔️✔️Validation

The validation partition is used to assess the performance of each supervised learning
model so that we can compare models and pick the best one. In some algorithms (e.g.,
classification and regression trees, k-nearest neighbors) the validation partition may be
used in automated fashion to tune and improve the model. This means that the
validation data are actually used to help build the model.


This is unsupervised learning, if we assume that we do not know what will be purchased
in the future.

The test data are used to build models, or to further tweak the model or improve its fit.

- True
- False - ✔️✔️False

The test data are not used to build models, or to further tweak the model or improve its
fit. (If the test data were used for these purposes, they would play a role in building or
selecting the best model, and would no longer provide an unbiased assessment of the
chosen model's performance with completely new data.)


When a model is fit to training data, zero error with those data is not necessarily good.
This special case is called ______.

- Overestimating
- Good fit
- Overfitting - ✔️✔️Overfitting

Overfitting occurs when the model captures not only the generalizeable pattern in the
data, but also the error. When we split the data into training and validation sets, we
assume that the same pattern (if there is a pattern) exists in both, and that they differ
only in the error that they contain. An absurd and false model may fit perfectly (on
training data set) if the model has enough complexity. Therefore, we may get zero error
for such a model using the training dataset. Such a model, however, is not likely to give
useful results on the validation data set.

, Bar charts are useful for comparing a single statistic (e.g. average, count, percentage)
across groups. The height of the bar represents the value of statistic, and different bars
correspond to different groups.

- True
- False - ✔️✔️True

Which of the following are the most popular visualization tools in JMP_Pro? (3 correct
answers)

- Distribution
- Fit Y by X
- Graph Builder
- Data visualizer
- Graph wizard - ✔️✔️- Distribution
- Fit Y by X
- Graph Builder

Scatter plots play important role in prediction. Next step can be developing a model.
Scatter plots provide information about relationships (linear or non-linear) between
variables. The variables in scatter plot ________.

- can be nominal
- must be numerical
- can be both numerical and categorical
- must be ordinal - ✔️✔️- must be numerical

In a box plot, the box include %50 of the data, the horizontal line represents
(i)____________, the top and bottom of the box represent (ii)________, respectively.

- (i) the mean, (ii) 75th and 25th percentiles
- (i) the mean, (ii) 10th and 90th percentiles
- (i) the median (50th percentile), (ii) bounds for outliers
- (i) the median (50th percentile), (ii) 75th and 25th percentiles - ✔️✔️- (i) the median
(50th percentile), (ii) 75th and 25th percentiles

In JMP a diamond is displayed in the box, where the center of the diamond is
_________.

- The median
- The mean
- The skewness value
- The halfway between outliers - ✔️✔️- The mean

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller PatrickKaylian. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $8.49. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

76658 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$8.49
  • (0)
  Add to cart